Wideband multiple access telecommunication method and apparatus

ABSTRACT

The present invention is related to a system and method for wideband multiple access telecommunication. In the method, a block is transmitted from a basestation to a terminal. The block comprises a plurality of chip symbols scrambled with a base station specific scrambling code, the plurality of chip symbols comprising a plurality of spread user specific data symbols which are user specific data symbols spread by using user specific spreading codes and at least one pilot symbol. In the terminal, at least two independent signals that comprise at least a channel distorted version of the transmitted block are generated. The two independent signals are combined with a combiner filter with filter coefficients which are determined by using the pilot symbol, thus a combined filtered signal is obtained. The combined filtered signal is despread and descrambled with a composite code of the basestation specific scrambling code and one of the user specific codes.

RELATED APPLICATIONS

This application claims priority under 35 U.S.C. 119(e) from U.S. Provisional Patent Application No. 60/286,486, filed Apr. 26, 2001, and which is incorporated by reference.

FIELD OF THE INVENTION

The invention relates to telecommunication methods and systems, more in particular Wideband Code Division Multiple Access (WCDMA) for wireless communications.

DESCRIPTION OF RELATED TECHNOLOGY

Wideband direct sequence code division multiple access (WCDMA) is emerging as the predominant wireless access mode for forthcoming 3G systems, because it offers higher data rates and supports larger number of users over mobile wireless channels compared to conventional access techniques like TDMA and narrowband CDMA [1]. Demodulation of WCDMA signals in multi-path channels is conventionally achieved with a Maximum Ratio Combining (MRC) RAKE receiver [2]. Although the RAKE receiver is optimal for single-user multi-path channels, multi-user interference (MUI) severely limits its performance in a multi-user setting. Moreover, the MUI is enhanced by the near/far situation in the uplink, where the data signal of a desired farby user can be overwhelmed by the data signal of an interfering nearby user.

Multi-user detection techniques, that use joint code, timing and channel information, alleviate the near/far problem and offer significant performance improvement compared to the RAKE receiver [3]. Linear symbol-level multi-user equalizer receivers, that implicitly depend on the instantaneous code-correlation matrix, have been extensively studied in [4], [5], [6], [7], [8] and [9]. Practical adaptive implementations of these receiver techniques require a constant code-correlation matrix in order to guarantee convergence of the adaptive updating algorithm and therefore call for the use of short spreading codes (the code period equals the symbol period) [10]. Although well-suited for implementation in the basestation (the standard foresees a mode with short spreading codes for the uplink [11]), these techniques can not be used in the mobile station since the WCDMA downlink employs long spreading codes (the code period is much longer than the symbol period).

In the WCDMA downlink, the symbol streams of the different active users are multiplexed synchronously to the transmission channel by short orthogonal spreading codes that are user specific and a long overlay scrambling code that is base-station specific. The MUI is essentialy caused by the channel, since the different user signals are distorted by the same multi-path channel when propagating to the mobile station of interest. Moreover, power control in the downlink enhances the MUI and creates a far/near situation where the data signal of a desired nearby user is overwhelmed by the data signal intended for a farby user. Linear chip-level equalization, introduced in [12], can restore the orthogonality of the user signals and suppress the MUI. Ideal Zero-Forcing (ZF) and Minimum Mean Squared Error (MMSE) chip-level equalizer receivers have also been investigated in [13], [14], [15] and [16]. They consist of a linear chip-leve equalizer mitigating the inter-chip interference (ICI), followed by a descrambler/despreader and a decision device.

However, adaptive implementations of such chip-level equalizer receivers that can track time-varying multi-path channels are hard to realize in practice because of two reasons. On the one hand, sending a training chip sequence at regular time instants in correspondence with the coherence time of the channel, would reduce the system's spectral efficiency. On the other hand, using blind techniques that do not require any training overhead, would reduce the system's performance. Nevertheless, most current proposals use blind algorithms to determine the equalizer coefficients. One method applies a standard single-user blind channel identification algorithm and employs the obtained channel estimate to calculate the equalizer coefficients [17]. Another blind approach, pursued in [18] and [19] makes use of the fact that, in the absence of noise, the received signal after equalization should lie in the subspace spanned by the user codes.

GB 2362 075 A describes a Wideband Code Division Multiple Access (WCDMA) method, wherein according to FIG. 4 therein, from the received signal time-delayed versions are made, each of each time-delayed versions are correlated with each of the codes available within the communication system. Said correlated time-delayed signals are then each filtered with a user specific and signal dependent filter. Said filtered correlated time-delayed signals are then finally decimated and combined. It is important to note that the step of time-delaying, correlating (here denoted despreading) and filtering happens at chip-rate signals.

SUMMARY OF CERTAIN INVENTION ASPECTS

One embodiment of the invention provides in a wireless communication system having at least one basestation and at least one terminal, a method of wideband multiple access communication, comprising transmitting a block from a basestation to a terminal, the block comprising a plurality of chip symbols scrambled with a base station specific scrambling code, the plurality of chip symbols comprising a plurality of spread user specific data symbols which are user specific data symbols spread by using user specific spreading codes and at least one pilot symbol, and performing in the terminal generating a plurality of independent signals having at least a channel distorted version of the transmitted block, combining the plurality of independent signals with a combiner filter with filter coefficients which are determined by using the pilot symbol, thus obtaining a combined filtered signal and despreading and descrambling the combined filtered signal with a composite code of the basestation specific scrambling code and one of the user specific spreading codes.

Another embodiment of the invention provides in a wireless communication system having at least one basestation and at least one terminal, a method of wideband multiple access communication, comprising transmitting a block from a basestation to a terminal, the block comprising a plurality of chip symbols scrambled with a base station specific scrambling code, the plurality of chip symbols comprising a plurality of spread user specific data symbols, which are user specific data symbols spread by using user specific spreading codes and at least one pilot symbol, transmitting projection information, enabling projecting of a signal on the orthogonal complement of the space spanned by the composite codes of the basestation specific scrambling code and the user specific codes, and performing in the terminal generating at least two independent signals comprising at least a channel distorted version of the transmitted block, combining with a combiner filter with filter coefficients determined by using the projection information being determined on the signals, thus obtaining a combined filtered signal and despreading and descrambling the combined filtered signal with a composite code of the basestation specific scrambling code and one of the user specific codes.

Another embodiment of the invention provides a receiver system for wireless communication, comprising a receiver configured to receive a block comprising a plurality of chip symbols which are data symbols spread by using scrambling and spreading codes and at least one pilot symbol, at least one combining filter circuit configured to receive at least two independent signals derived from the received block and output a multi-user interference suppressed signal, and at least one despreading circuit configured to despread the multi-user interference suppressed signal with one of the scrambling and spreading codes.

Another embodiment of the invention provides a receiver system for wireless communication, comprising a receiver configured to receive a block comprising a plurality of chip symbols which are data symbols spread by using scrambling and spreading codes, and projection information, enabling projecting of a signal on the orthogonal complement of the space spanned by scrambling and spreading codes, at least one combining filter circuit configured to input at least two independent signals derived from the received block and output a multi-user interference suppressed signal, and at least one despreading circuit configured to despread the multi-user interference suppressed signal with one of the scrambling and spreading codes.

Another embodiment of the invention provides a receiver system for wireless communication, comprising J combining filter circuits, each of the combining filter circuits receiving a plurality of independent signals and outputing a multi-user interference suppressed signal, a circuit configured to determine the coefficients of the combining filters circuit, a plurality of despreading circuits configured to despread the J multi-user interference suppressed signals and a combining circuit configured to combine the J multi-user interference suppressed signals.

Still another embodiment of the invention provides in a wireless communication system having at least one basestation and at least one terminal, a method of wideband multiple access communication, comprising receiving a block comprising a plurality of chip symbols scrambled with a base station specific scrambling code, the plurality of chip symbols comprising a plurality of spread user specific data symbols which are user specific data symbols spread by using user specific spreading codes and at least one pilot symbol, generating at least two independent signals based on the received block, and combining the at least two independent signals with a combiner filter having filter coefficients which are determined by using the pilot symbol, thus obtaining a combined filtered signal.

Still another embodiment of the invention provides a receiver system for wireless communication, comprising means for receiving a block comprising a plurality of chip symbols scrambled with a base station specific scrambling code, the plurality of chip symbols comprising a plurality of spread user specific data symbols which are user specific data symbols spread by using user specific spreading codes and at least one pilot symbol, means for generating at least two independent signals based on the received block and means for combining the at least two independent signals with a combiner filter having filter coefficients which are determined by using the pilot symbol, thus obtaining a combined filtered signal.

Yet another embodiment of the invention provides a method of retrieving a desired user data symbol sequence from a received signal, the method comprising receiving a channel modified version of a transmitted signal comprising a plurality of user data symbol sequences, each being encoded with a user specific known code, determining an equalization filter directly and in a deterministic way from the received signal, and applying the equalization filter on the received signal to thereby retrieve the transmitted signal.

Yet another embodiment of the invention provides a system for retrieving a desired user data symbol sequence from a received signal, comprising means for receiving a channel modified version of a transmitted signal comprising a plurality of user data symbol sequences, each being encoded with a user specific known code, means for determining an equalization filter directly and in a deterministic way from the received signal, and means for applying the equalization filter on the received signal to thereby retrieve the transmitted signal.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 illustrates a telecommunication system in a single-cell configuration.

FIG. 2 illustrates a block diagram of a telecommunication system according to one embodiment of the invention.

FIG. 3 illustrates a detailed block diagram of the telecommunication system shown in FIG. 2.

FIG. 4 illustrates a block diagram of a telecommunication system according to another embodiment of the invention.

FIG. 5 illustrates a telecommunication system in a multiple-cell configuration.

FIG. 6 illustrates a flowchart that explains the operation of a telecommunication system according to one embodiment of the invention.

FIG. 7 illustrates a flowchart that explains the operation of a telecommunication system according to another embodiment of the invention.

FIG. 8 illustrates a block diagram of a transmitter model of the j-th base-station according to one embodiment of the invention.

FIG. 9 illustrates a block diagram of a receiver model of the mobile station of interest according to one embodiment of the invention.

FIG. 10 illustrates a chart showing block processing for small block length B=30 according to one embodiment of the invention.

FIG. 11 illustrates a chart showing block processing for large block length B=60 according to one embodiment of the invention.

FIG. 12 illustrates a chart showing block processing for small interference power P_(i)=−10 dB according to one embodiment of the invention.

FIG. 13 illustrates a chart showing block processing for large interference power P_(i)=+10 dB according to one embodiment of the invention.

FIG. 14 illustrates a chart showing block processing for full system load K=11 according to one embodiment of the invention.

FIG. 15 illustrates a chart showing block processing for exponential power delay profile according to one embodiment of the invention.

FIG. 16 illustrates a chart showing adaptive processing for half system load K=5 according to one embodiment of the invention.

FIG. 17 illustrates a chart showing adaptive processing for full system load K=11 according to one embodiment of the invention.

FIG. 18 illustrates a chart showing block processing algorithms for time-multiplexed pilot: J=1 base-station and half system load according to one embodiment of the invention.

FIG. 19 illustrates a chart showing block processing algorithms for time-multiplexed pilot: J=1 base-station and full system load according to one embodiment of the invention.

FIG. 20 illustrates a block diagram of a data model and user-specific detection part of pilot-aided adaptive chip equalizer receiver according to one embodiment of the invention.

FIG. 21 illustrates a block diagram of a pilot-aided updating part of pilot-aided chip equalizer receiver according to one embodiment of the invention.

FIG. 22 illustrates a comparison chart of fractionally-spaced versus symbol-spaced adaptive chip equalizer receiver in quarter system load according to one embodiment of the invention.

FIG. 23 illustrates a comparison chart of fractionally-spaced versus symbol-spaced adaptive chip equalizer receiver in half system load according to one embodiment of the invention.

FIG. 24 illustrates a comparison chart of fractionally-spaced versus symbol-spaced adaptive chip equalizer receiver in full system load according to one embodiment of the invention.

FIG. 25 illustrates a chart showing influence of Doppler-spread on performance of adaptive chip equalizer receiver according to one embodiment of the invention.

FIG. 26 illustrates a flowchart that explains the operation of a telecommunication system according to another embodiment of the invention.

DETAILED DESCRIPTION OF CERTAIN INVENTIVE EMBODIMENTS

A. System Overview

In the invention, methods for communication, more in particular, wireless communication, between devices and the related devices are presented (FIG. 1). In said method at least data (10) is transmitted from at least one basestation (100) to at least one terminal (200). Said communication method is extendable to a case with a plurality of basestations (FIG. 5, 100, 110), each basestation being designed for covering a single cell (FIG. 5, 20, 21) around such basestation. In such multiple basestation and hence multicell case a terminal receives typically signals from both the most nearby basestation and other basestations. Within the method it is assumed that the basestation has at least one antenna (110) and the terminal also has at least one physical antenna (210). The communication between the basestation(s) and the terminal is designed such that said communication is operable in a context with multiple terminals. Hence it is assumed that substantially simultaneous communication between said basestation(s) and a plurality of terminals is happening, while still be able to distinguish at the terminal side which information was intended to be transmitted to a dedicated terminal. The notion of a user is introduced. It is assumed that with each terminal in such a multi terminal context at least one user is associated. The invented communication method and related devices exploit spreading with orthogonal codes as method for separating information streams being associated with different users. Hence at the basestation side information, more in particular data symbols, of different users, hence denoted user specific data symbols are available. After spreading spread user specific data symbols are obtained. These spread user specific data symbols are added, leading to a sum signal of spread user specific data symbols. Further additional scrambling of said sum signal is performed by a scrambling code being basestation specific. Symbols obtained after spreading or spreading and scrambling, and summing are denoted chip symbols. In the invented communication method blocks with a plurality of said chip symbols are transmitted. In a single basestation case the transmitted signal thus comprises of a plurality of time overlapped coded signals, each coded signal being associated to an individual user and distinguishable only by a user specific encoding, based on the user signature or spreading codes. In a multiple basestation context, the distinguishing also exploits said basestation specific code. Further such blocks have at least one pilot symbol (also called training sequence and denoted by s_(p)[i] in equation (46)) being predetermined and known at both sides of the transmission link. In the invented method the availability of a receiver (FIG. 2, 220) being capable of generating at least two independent signals (310) from a received signal is assumed. Said receiver receives a spread-spectrum signal, corresponding to a superposition of the signals of all users active in the communication system or link, more in particular said superposition of signals is channel distorted. Said generation of at least two independent signals can be obtained by having either at least two antennas at the terminal, each independent signal being the signal (300) received at such antenna after the typical down-converting and filtering steps. In case of a single antenna terminal, polarization diversity [20] of said single antenna can be exploited or the temporal oversampling of the transmitted signal can be used. In all case these independent signals are channel distorted versions (caused by the time-dispersive nature of the multi-path channel and described by equation (47)) of the signal transmitted by the basestation(s), including the transmitted block of chip symbols. Alternatively formulated it can be stated that the receiver or front-end circuitry provides for samples, typically complex baseband samples for a plurality of channels (either via different antennas or via polarisation diversity or oversampling) in digital format.

Recall that the invention exploits spreading with orthogonal codes for separating different users. Unfortunately said channel distortion is destroying the orthogonality of the used codes, leading to a bad separation. This problem is denoted multi-user interference. Hence the invention concerns a method for retrieving a desired user's symbol sequence (denoted by s_(k)[i] in equation (46)), from a received signal transmitted in a communication context with multi-user interference. In said method a step of inputting or receiving said received signal, being a channel distorted version of a transmitted signal comprising a plurality of user data symbol sequences, each being encoded with a user specific known code is found.

It is an aspect of the invention to provide a method which suppresses the multi-user interference by performing operations on the chip symbols, hence before despreading and descrambling. This multi-user interference suppression is obtained by performing combining of said independent signals resulting in a combined filtered signal (320). In an embodiment of the invention said combining, also denoted chiplevel equalization, is a linear space-time combining. For said combining (chip-level equalizer) a combiner filter (230) is used. The (chip-level equalization) filter coefficients (420) of said combiner filter are determined directly from said independent signals (310), hence without determining an estimate of the channel characteristic. One can state that from said independent signals (310) in a direct and deterministic way a chiplevel equalization filter is determined. Said chip-level equalization filter is such that said transmitted signal is retrieved when applying said filter to said received signal.

More in particular said filter coefficients are determined by using said independent signals and the knowledge of the pilot symbol. Finally despreading and descrambling (240) with a code being the composite of said basestation specific scrambling code and one of said user specific codes (denoted by c_(k)[n] in, equation (46)) is applied on said combined filtered signal (320) to obtain at least of one of said user specific data symbols. Said last step thus retrieves the desired user data symbol by correlating said retrieved transmitted signal (320) with the desired user specific known code and basestation specific code.

Note that within the despreading and descrambling step steps of performing correlations of modified received spread spectrum signals against selected replica of a spreading sequence are done, here with modified is meant linear space-time combining. Further said despreading step is used to extract the user corresponding to the spreading code used, more in particular the replica of the spreading code used at the transmitting side.

Alternatively said method for detection of a spread-spectrum signal can be formulated as comprising the steps of receiving a spread-spectrum signal including a training sequence, said signal corresponds to a superposition of signals of users; generating a first spread spectrum signal derived from said received spread spectrum signal and a second spread spectrum signal derived from said received spread spectrum signal; directly calculating the coefficients of a filter using knowledge of said training sequence from said derived spread-spectrum signals; and at least passing said first spread-spectrum signal and said second spread-spectrum signal different from said first spread-spectrum signal through said filter to generate a combined spread-spectrum signal; and despreading the filtered spread-spectrum signal, to extract a given user.

Within the invention said generated signals are independent, meaning received through another channel, either by having multiple antenna's, using the polarisation diversity of the available antennas or using temporal oversampling. Said generated signals are not a time-delayed version of each other. The combining of said generated signals is done in order to correct the signal for multi-user interference, more in particular to suppress multi-user interference. The combining filter is independent of the user one finally wants to detect. The combined signal is only then despread, meaning within the context of the description, correlated and decimated. The correlating is performed only with the user specific code of the user one wants to detect. Within the invention the combining is done before correlating steps.

The method of detection of a user signal, thus comprises the steps of generating independent signals, filtering said generated independent signals with a filter being independent of said user, said filtering results in a combined signal; and despreading and descrambling with a code being user specific. Alternatively it can be stated that the method of detection of a user signal, involves in the detection part, only the code being user specific. The combining is done at chip-rate signals, hence before any step of despreading or even correlating is made. For the sake of clarity it should be understood that in some embodiment in the part for determining the combining filter, thus not in the detection part itself, despreading is done as a first step before the filter is determined. However this despreading is done with the pilot code only. Within the method (FIG. 6) one can thus distinguish a filter determination part, denoted also initialisation, and a detection part using said filter. The method of filter determination despreads said generated independent signals with the pilot code while in the detection part despreading is performed on said filtered signal with the user specific code. It should be also understood that in some other embodiment in the part for determining the combining filter, thus not in the detection part itself, projecting rather than despreading is done as a first step before the filter is determined. However, this projecting is based on the multi-user code correlation matrix. Within the method (FIG. 7) one can thus distinguish a filter determination part, denoted also initialisation, and a detection part using said filter. The method of filter determination projects said generated independent signals using a projection matrix derived from the multi-user code correlation matrix while in the detection part despreading is performed on said filtered signal with the user specific code.

In a first embodiment of said method said performing of combining (FIG. 3, 230) comprises the steps of performing filtering, preferable digital linear filtering (231) on said signals (310) to obtain the filtered signals (315) and combining (232), preferably linearly, said filtered signals to obtain said combined filtered signal (320). The filters used in said combining step have filter coefficients (420) being determined in a filter coefficient determination circuit (400) and are being determined by using said pilot symbol, more in particular by directly using said pilot symbol, hence the fact that said pilot symbol is known at the terminal side is exploited. Alternatively formulated, said (filter) coefficients are determined without (explicit) channel estimation (the action of explicitly determining the coefficients of the Finite Impulse Response (FIR) channel filter that models the underlying multi-path propagation channel). Note said twostep combining (230) is used for suppressing multi-user interference, hence restoring the orthogonality of the codes. Note that the chip-level equalization (filter coefficient) determination step does not rely on a-priori estimation of the channel characteristics. Also signal statistics are not used within the chiplevel equalization determination step, hence the notion of deterministic determination of said coefficients can be used.

In a second embodiment of said method the direct filter coefficient determination is disclosed, indicating that said filter coefficients are determined such that one version or function of the combined filtered signal is as close as possible to a version or function of the pilot symbol. With as close as possible is meant according to a norm defined on signals, for instance a two-norm, resulting in Least Squares (LS) sense minimization. The LS minimization can also be recursified leading to Recursive Least Squares (RLS) minimization. Other minimization approaches such as Least Mean Squares (LMS) are also possible.

B. Code Division Multiplexed Pilot Based Embodiment

In a first embodiment of said direct filter coefficient determination a communication method is presented, denoted code division multiplexed pilot based determination, wherein for each user a user specific code is available but at least one of said user specific codes is used for spreading a known signal, denoted the pilot symbol. The associated user specific code is therefore denoted the pilot code. Hence in said transmitted block a pilot symbol being spread by a pilot code is found.

B.1 Raining Based Subembodiment

In a first subembodiment thereof a so-called training-based approach is used, relying on the knowledge of said pilot symbol. More in particular said direct filter determination is such that the used version of the combined filtered signal is the combined filtered signal after despreading with a code being the composite code of said basestation specific scrambling code and said pilot code (denoted by c_(p)[n] in equation (46) while said version of said pilot symbol is said pilot symbol itself. It should be noted however, that in practice said independent signals are first despread before combining takes place (see FIG. 6).

In an example thereof said chip-level equalization filter is determined as a multiplication involving said known pilot symbol vector, a matrix involving the composite pilot codes, a Hankel structure output matrix, involving the received signals.

B.2 Semi-Blind Based Subembodiment

In a second subembodiment thereof a so-called semi-blind based approach is used, relying on the knowledge of said pilot symbol but also on characteristics of the codes. More in particular said direct filter determination is such that the used version of the combined filtered signal is the combined filtered signal after projecting on the orthogonal complement on the subspace spanned by the composite codes, being the codes composed of said base station specific scrambling code and said user specific codes, and said version of the pilot symbol is said pilot symbol spread with a composite code of said base station specific scrambling code and said pilot code. It should be noted however, that in practice said independent signals are first projected before combining takes place (see FIG. 7).

In an example thereof said chip-level equalization filter is determined as a multiplication involving said known pilot symbol vector, a matrix involving the pilot codes, a Hankel structure output matrix, involving the received signals and a matrix involving the user codes. Hence one can state that said chip-level equalization determination step exploits said known pilot symbol sequence, said pilot specific known code(s) and substantially all (active) user specific known codes.

C. Time Division Multiplexed Pilot Based Embodiment

In a second embodiment of said direct filter coefficient determination a communication method is presented, denoted time division multiplexed pilot based method, wherein for each user a user specific code is available but in said block of symbols a plurality of a known pilot symbols are found, for each user code at least one. One can state that said transmitted block comprises at least two pilot symbols, each being spread by a user specific code. Hence said pilot symbols can be denoted user specific known pilot symbols (represented by s_(k)[i] in equation (64 )), each being encoded with a user specific known code.

C.1 Training Based Subembodiment

In a first subembodiment of said second embodiment a so-called training-based approach is used, relying on the knowledge of said pilot symbol. More in particular said direct filter determination is such that the used version of the combined filtered signal is the combined filtered signal after despreading with a code being the composite code of said basestation specific scrambling code and a user specific code and said version of said pilot symbol is said pilot symbol itself. It should be noted however, that in practice said independent signals are first despread before combining takes place (see FIG. 6).

In an example thereof said chip-level equalization filter is determined as a multiplication involving said known pilot symbol vector, a matrix involving the pilot codes and a Hankel structure output matrix.

C.2 Semi-blind Based Subembodiment

In a second subembodiment of said second embodiment a so-called semi-blind based approach is used, relying on the knowledge of said pilot symbol but also on characteristics of the codes. More in particular said direct filter determination is such that the used version of the combined filtered signal is the pilot part of the combined filter signal; and said version of the pilot symbol is the sum of the spread pilot symbols within said transmitted block. In said method further said filter coefficients are determined such that the data part of the combined filter signal after projecting on the orthogonal complement on the subspace spanned by the composite codes, being the codes composed of said base station specific scrambling code and said user specific codes being as close as possible to the zero subspace. It should be noted however, that in practice said data part of said independent signals are first projected before combining takes place (see FIG. 7).

In an example thereof said chip-level equalization filter is determined as a multiplication involving said known pilot symbol vector, a matrix involving the pilot codes, a Hankel structure output matrix, involving the received signals and a matrix involving the user codes. Hence one can state that said chiplevel equalization determination step exploits substantially all (active) user specific known codes.

D. Multiple Base Station Embodiment

In a third embodiment of the invention,as shown in FIG. 5, at least J base stations (100, 110) are transmitting to said terminal having at least an integer number M at least J+1/(St*Sp) physical antenna's with St the temporal oversampling factor and Sp the polarisation diversity order. Note that alternatively less than M antenna's can be used by trading off complexity versus performance.

E. Multiple Base Station Multi-Antenna Terminal Embodiment

In a fourth embodiment of the invention within said terminal at least M=J+1 independent signals (310) are generated from the receiver (220) with J the amount of basestations. Within said terminal said steps of space-time combining (230) is performed J times as shown in FIG. 4. Each of the resulting combined filtered signals (320) are multi-user interference suppressed versions of the signal transmitted from one of said basestations. Finally said despreading and descrambling (240) is performed J times as shown in FIG. 4, to obtain J intermediate soft estimates (330) of a user specific data symbol. Subsequently a step of combining (250), preferably linearly, of said J despread combined signals (330) to obtain a final soft estimate (340) of a user specific data symbol is performed followed by determining (260) from said final soft estimate a hard estimate (350) for said user specific data symbol.

F. Wireless Semi-Blind Embodiment

In a second aspect of the invention a wireless semi-blind communication method wherein besides user data symbols and pilot symbols, additional information is transmitted from the basestation(s) to said terminals. Said additional information (430), received by the receiver block (220), is used in the filter coefficient determination (400). More in particular said additional information is projection information, enabling projecting of said independent signals on the orthogonal complement of the subspace spanned by the composite codes of said basestation specific scrambling code and said user specific codes. Hence the space-time combiner filter has filter coefficients being determined by using said projection information. Depending on the length of the user specific codes N a maximal amount of users, being N in case of time division multiplexed pilots and N−1 in case of code division multiplexed pilots. In practice an amount of active user Ua less than said maximal amount are active.

In a first embodiment of said second aspect said projection information is constructed by using or considering the active user's.

In a second embodiment of said second aspect said projection information is constructed by using or considering a predetermined set of assumed active user's, the size of said set being another upperbound on the amount of active users allowed.

In a third aspect of the invention an apparatus with at least one antenna (210) comprising (i) a receiver (220) capable of receiving a block (300) comprising of a plurality of chip symbols being data symbols spread by using scrambling and spreading codes and at least one pilot symbol, (ii) at least one linear space-time combining filter circuit (230), inputting at least two independent signals (310) derived from said received block and outputting a multi-user interference suppressed signal (320); (iii) a circuit (400) for determination of said combining filter circuit's coefficients (420), said filter circuit's coefficients being determined (directly) by using said pilot symbol; and (iv) at least one despreading circuit (240), used for despreading with one of said scrambling and spreading codes said multi-user interference suppressed signal (320). Alternatively formulated said apparatus can be denoted a radio receiver comprising a front-end circuitry (220) for providing complex baseband samples for a plurality of channels in digital format; an integrated adaptive digital combining circuitry (230), connected to said front-end circuitry, for combining said complex baseband samples; and a digital component (240), connected to said combining circuitry, being arranged to perform correlations of received spread spectrum signals against selected replica offsets of a spreading sequence.

Alternatively formulated said apparatus for detecting of data signals from a transmitted signal in a Code-Division Multiple Access (CDMA) communication system, wherein the transmitted signal comprising a plurality of time overlapping coded signals, each coded signal associated to an individual user and distinguishable only by a user specific encoding (signature, spreading sequences), comprising a convolutional coder having at least two independent signals being representative for the transmitted signal as an input and outputting a single vector being representative for said transmitted signal; a decoder having said single vector as an input and outputting symbol information for at least one individual user.

In a fourth aspect of the invention an apparatus with at least one antenna (210) comprising (i) a receiver (220) capable of receiving a block comprising of a plurality of chip symbols being data symbols spread by using scrambling and spreading codes, and projection information, enabling projecting of said independent signals on the orthogonal complement of the subspace spanned by scrambling and spreading codes; (ii) at least one linear space-time combining filter circuit (230), inputting at least two independent signals (310) derived from said received block and outputting a multi-user interference suppressed signal, (iii) a circuit (400) for determination of said combining filter circuit's coefficients with filter circuit's coefficients (420) being determined by using said projection information (430), and (iv) at least one despreading circuit (240), used for despreading with one of said scrambling and spreading codes said multi user interference suppressed signal (320).

In a fifth aspect of the invention an apparatus comprising a plurality J of linear space-time combining filter circuits (230), each of said combining filter circuits inputting at least J+1 independent signals (310) and outputting a multi user interference suppressed signal (320), a circuit (400) for determination of said combining filters circuit's coefficients (420), a plurality of J despreading circuits (240), used for despreading said J multi-user interference suppressed signals (320); and a linear combining circuit (250–260) for combining said J spread multi-user interference suppressed signals.

The last combining circuit can comprise of a substep of determining a soft estimate (250) and a substep of determining a hard estimate (260) based on said soft estimate.

Alternatively formulated said apparatus can be denoted a radio receiver comprising a front-end circuitry (220) for providing complex baseband samples for a plurality of channels in digital format; a plurality of integrated adaptive digital combining circuitry (230), connected to said front-end circuitry, for combining said complex baseband samples; and a digital component (240), connected to said combining circuitry, being arranged to perform correlations of received spread spectrum signals against selected replica offsets of a spreading sequence.

Alternatively said apparatus for detection of a spread-spectrum signal can comprise of a processor arranged to provide a plurality of multiple input combining filter circuits (230), the plurality of multiple input combining filter circuits being respectively coupled to a plurality of despreading circuits (240).

Alternatively formulated said apparatus for detecting of data signals from a transmitted signal in a Code-Division Multiple Access (CDMA) communication system, wherein the transmitted signal comprising a plurality of time overlapping coded signals, each coded signal associated to an individual user and distinguishable only by a user specific encoding (signature, spreading sequences), comprising a plurality of convolutional coders having at least two independent signals being representative for the transmitted signal as an input and outputting a single vector being representative for said transmitted signal; a decoder having said single vector as an input and outputting symbol information for at least one individual user.

Said apparatus further comprises an algorithm unit (400), said algorithm unit is coupled to said plurality of combining filter circuits. In an embodiment said algorithm unit is an adaptive algorithm unit. The algorithm unit is used for calculating the coefficients of said combining filters based upon a training sequence within said spread-spectrum signal, more in particular exploiting a priori knowledge of the training sequence at the detection apparatus side of the communication link.

Before describing the embodiment of the invention more in detail one should note that the block transmitted from said at least one basestation to an at least one terminal comprises of a plurality of chip symbols scrambled with a base station specific scrambling code, said plurality of chip symbols comprising a plurality of spread user specific data symbols, being user specific data symbols spread by using user specific spreading codes and at least one pilot symbol.

Alternatively formulated in a single-cell concept (meaning a single basestation) the transmitted signal can be a synchronous code division multiplex, employing user specific orthogonal Walsh-Hadamard spreading codes and a base-station specific aperiodic scrambling code. The user aperiodic code sequence is the multiplication of the corresponding spreading code and the base-station specific aperiodic scrambling code. In a code division multiplexed pilot approach the multi-user chip sequence (denoted by x[n] in equation (46)) comprises of K user signals plus at least one continuous pilot signal. In a time division multiplexed pilot approach the multi-user chip sequence (denoted by x[n] in equation (64)) comprises of a first set of user symbols, also denoted data symbols and a second set of known pilot symbols.

Alternatively formulated in a multiple cell concept (meaning a plurality of basestations) the transmitted signal of each basestation can be a synchronous code division multiplex, employing user specific orthogonal Walsh-Hadamard spreading codes and a base-station specific aperiodic scrambling code. The user terminal then receives a sum of essentially all said transmitted signals. The user aperiodic code sequence is the multiplication of the corresponding spreading code and the base-station specific aperiodic scrambling code. In a code division multiplexed pilot approach the multi-user chip sequence comprises of K user signals plus at least one continuous pilot signal. In a time division multiplexed pilot approach the multi-user chip sequence comprises of a first set of user symbols, also denoted data symbols and a second set of known pilot symbols.

Below various embodiments of the invention are described. The invention is not limited to these embodiments but only by the scope of the claims.

G. Embodiment

G.1 Data Model for Multi-cell WCDMA Downlink

G.1.a Multi-channel framework. We consider the downlink of a multi-cell WCDMA system with J active base-stations, transmitting to the mobile station of interest (soft handover mode). Each base-station transmits a synchronous code division multiplex, employing short orthogonal Walsh-Hadamard spreading codes that are user specific and a long overlay scrambling code that is base-station specific. As shown in FIG. 8, the multi-user chip sequence, transmitted by the j-th base-station x^(j)[n], consists of K_(j) active user signals and a continuous pilot signal:

$\begin{matrix} {{x^{j}\lbrack n\rbrack} = {{\sum\limits_{k = 1}^{K_{j}}{{s_{k}^{j}\lbrack i\rbrack}{c_{k}^{j}\left\lbrack {n\mspace{11mu}{mod}\mspace{11mu}\rho\; N} \right\rbrack}}} + {{s_{p}^{j}\lbrack i\rbrack}{\underset{p}{\overset{j}{c}}\left\lbrack {n\mspace{11mu}{mod}\mspace{11mu}\rho\; N} \right\rbrack}}}} & (1) \end{matrix}$ with

$i = {\left\lfloor \frac{n}{N} \right\rfloor.}$

The data symbol sequence s_(k) ^(j)[i] corresponds to the Dedicated Physical Data CHannel (DPDCH) for the k-th user from the j-th base-station whereas the pilot symbol sequence s_(p) ^(j)[i] corresponds to the j-th base-station's Common PIlot CHannel (CPICH) in the UTRA specification for 3G systems [11]. For notational simplicity, we assume that the power of a symbol sequence is incorporated in the symbol sequence itself Each user's data symbol sequence s_(k) ^(j)[i] (pilot symbol sequence s_(p) ^(j)[i]) is spread by a factor N with the length-ρN user composite code sequence c_(k) ^(j)[n] (pilot composite code sequence c_(p) ^(j)[n]). The k-th user's composite code sequence for the j-th base-station c_(k) ^(j)[n] (pilot composite code sequence c_(p) ^(j)[n]) is the multiplication of the user specific short Walsh-Hadamard spreading code ć_(k) ^(j)[n] (pilot-specific Walsh-Hadamard spreading code ć_(p) ^(j)[n]) and the base-station specific long scrambling code c_(s) ^(j)[n].

Assume that the mobile station is equipped with M receive antennas and let h_(m) ^(j)(t) denote the continuous-time channel from the j-th base-station to the m-th receive antenna, including the transmit and receive filters. The received signal at the m-th receive antenna can then be written as:

${y_{m}(t)} = {{\sum\limits_{j = 1}^{J}{\sum\limits_{n^{\prime} = {- \infty}}^{+ \infty}{{h_{m}^{j}\left( {t - {n^{\prime}T_{c}}} \right)}{x^{j}\left\lbrack n^{\prime} \right\rbrack}}}} + {e_{m}(t)}}$ with e_(m)(t) the continous-time additive noise at the m-th receive antenna. By sampling the received signal at the chiprate T_(c), we obtain the following received sequence at the m-th receive antenna:

${y_{m}\lbrack n\rbrack} = {{y_{m}\left( {nT}_{c} \right)} = {{\sum\limits_{j = 1}^{J}{\sum\limits_{n^{\prime} = {- \infty}}^{+ \infty}{{h_{m}^{j}\left\lbrack n^{\prime} \right\rbrack}{x^{j}\left\lbrack {n - n^{\prime}} \right\rbrack}}}} + {e_{m}\lbrack n\rbrack}}}$ with e_(m)[n] the discrete-time additive noise at the m-th receive antenna and h_(m) ^(j)[n]=h_(m) ^(j)(nT_(c)) the discrete-time channel from the j-th base-station to the m-th receive antenna. Stacking the received samples obtained from the M receive antennas:

$\begin{matrix} {{{y\lbrack n\rbrack} = \left\lbrack {{y_{1}\lbrack n\rbrack}\mspace{20mu}{y_{2}\lbrack n\rbrack}\mspace{14mu}\ldots\mspace{14mu}{y_{M}\lbrack n\rbrack}} \right\rbrack^{T}}\;\text{we can write:}} & \; \\ {{y\lbrack n\rbrack} = {{\sum\limits_{j = 1}^{J}{\sum\limits_{n^{\prime} = {- \infty}}^{+ \infty}{{h^{j}\left\lbrack n^{\prime} \right\rbrack}{x^{j}\left\lbrack {n - n^{\prime}} \right\rbrack}}}} + {e\lbrack n\rbrack}}} & (2) \end{matrix}$ where e[n] is similarly defined as y[n] and h^(j)[n] is the discrete-time M×1 vector channel from the j-th base-station to the M receive antennas, given by: h^(j)[n]=[h₁ ^(j)[n]h₂ ^(j)[n] . . . h_(M) ^(j)[n]]^(T) Note that we model h^(j)[n] as an M×1 FIR vector filter of order L_(j) with delay index δ_(j) (h^(j)[n]≠0, for n=δ_(j) and n=δ_(j)+L_(j), and h^(j)[n]=0, for n<δ_(j) and n>δ_(j)+L_(j)).

Until now, we have considered chip rate sampling at the receiver side. Since in practice the transmit and receive filters are root raised cosine filters for a rate

$\frac{1}{T_{c}}$ with a bandwidth somewhat higher than

$\frac{1}{T_{c}},$ we should actually sample at a rate that is higher than

$\frac{1}{T_{c}}.$ This is called temporal oversampling. Sampling at S_(t) times the chip rate, the data model described in Equation 65 would become a multi-channel model with MS_(t) diversity channels per base-station. Similar to temporal oversampling, polarization diversity can increase as well the number of diversity channels per base-station [21]. Sampling at S_(t) times the chip rate and applying S_(p)-fold polarization diversity, the data model described in Equation 65 would become a multi-channel model with MS_(t)S_(p) diversity channels per basestation instead of merely M. In practice a temporal oversampling factor of S_(t)=2 suffices. Since in real-life only the horizontal and the vertical polarization can be exploited, the polarization diversity order is limited to S_(p)=2. For simplicity reasons, we will only consider chip rate sampling and 1-fold polarization diversity at the receiver side. Note however that all future discussions remain valid if we replace M by MS_(t)S_(p).

G.1.b Data model for block processing. Let us now introduce the following (Q+1)M×BN output matrix Y_(a);

$\begin{matrix} {Y_{a} = \begin{bmatrix} {y\lbrack a\rbrack} & {y\left\lbrack {a + 1} \right\rbrack} & \cdots & {y\left\lbrack {a + {BN} - 1} \right\rbrack} \\ \vdots & \vdots & \; & \vdots \\ {y\left\lbrack {a + Q} \right\rbrack} & {y\left\lbrack {a + Q + 1} \right\rbrack} & \cdots & {y\left\lbrack {a + Q + {BN} - 1} \right\rbrack} \end{bmatrix}} & (3) \end{matrix}$ where B is the block length, a is the processing delay and Q+1 is the temporal smoothing factor. This output matrix can be written as:

Y a = ∑ j = 1 J ⁢ i ⁢ X a j + E a ( 4 ) where the noise matrix E_(a) is similarly defined as Y_(a) and the j-th base-station's (Q+1)M×r_(j) (r_(j)=L_(j)+1+Q) channel matrix H^(j) with block Toeplitz structure is given by:

j = [ h j ⁡ [ δ j + L j ] ⋯ h j ⁡ [ δ j ] 0 ⋯ 0 0 h j ⁡ [ δ j + L j ] ⋯ h j ⁡[ δ j ] ⋯ 0 ⋰ ⋰ 0 ⋯ 0 h j ⁡ [ δ j + L j ] ⋯ h j ⁡ [ δ j ] ] The j-th base-station's r_(j)×BN input matrix X_(a) ^(j) is given by:

$X_{\alpha}^{j} = \left\lbrack \begin{matrix} x_{\alpha - \delta_{j} - L_{j}^{T}}^{i} & \cdots & \left. x_{\alpha - \delta_{j} + Q^{T}}^{j} \right\rbrack^{T} \end{matrix} \right.$ where the multi-user chip sequence vector x_(a) ^(j), transmitted by the j-th base-station, starting at delay a is defined by: x _(a) ^(j) =[x ^(j) [a]x ^(j) [a+1] . . . x ^(j) [a+BN−1]]  (5) Note that Equation 4 can also be written as: Y _(a) =HX _(a) +E _(a)  (6) where H is the (Q+1)M×r channel matrix, given by: H=[H¹ . . . H^(J)]  (7) and X_(a) is the following r×BN input matrix: X_(a)=[X_(a) ¹ ^(T) . . . X_(a) ^(J) ^(T) ]^(T)  (8)

$r = {\sum\limits_{j = 1}^{J}r_{j}}$ is called the system order. In order to guarantee the existence of Zero-Forcing (ZF) chip-level equalizers, we make the following rather standard assumptions about the data model:

Assumption 1: The channel matrix H has full column rank r.

Assumption 2: The input matrix X_(a) has full row rank r.

The first assumption requires that (Q+1)M≧r, which is equivalent with:

$\begin{matrix} {{\left( {Q + 1} \right)\left( {M - J} \right)} \geq {\sum\limits_{j = 1}^{J}L_{j}}} & (9) \end{matrix}$ Therefore, in order to simultaneously track J base-stations the mobile station should be equipped with at least M=J+1 receive antennas for chip rate sampling. However, when we use a temporal oversampling factor of S_(t)=2 and we exploit S_(p)=2-fold polarization diversity at the receiver, we obtain MS_(t)S_(p)=4 (good for soft handover between J=3 base-stations) with only M=1 receive antenna at the mobile station.

The second assumption, on the other hand, requires that: BN≧r  (10) which states that the number of observed chip samples BN should be larger than the system order r. G.2 Space-time Block Chip-level Equalizer Receivers

In this section, we develop four space-time block chip-level equalizer receivers for the mobile station operating in soft handover mode: the pilot-aided block RAKE receiver, the fully-trained, the pilot-trained and the enhanced pilot-trained block chip-level equalizer receiver. The receivers can detect the desired user's data symbols from each of the J active base-station signals. They address blocks of B symbols at once and, as shown in FIG. 9, consist of J parallel branches, one for each base-station connected to the mobile station of interest. Each branch is made up of a linear space-time chip-level equalizer followed by a correlator. The space-time chip-level equalizer of the j-th branch linearly combines the discrete-time signals from the different antennas and tries to restore the multi-user chip sequence, transmitted by the j-th base-station. The chip-level equalizer receivers differ in the amount of a-priori information they assume to determine their equalizer coefficients. The correlator of the j-th branch descrambles and despreads the equalized signal with the desired user's composite code sequence for the j-th base-station. The final soft decisions about the desired user's data symbols are obtained by combining the correlator outputs of the different branches. These soft decisions are then input to a decision device that generates the final hard decisions.

G.2.a Preliminary definitions. The multi-user chip vector transmitted by the j-th base-station x₀ ^(j), starting at delay a=0, consists of two parts: a first part corresponding to the unknown data symbols of the different DPDCHs and a second part corresponding to the known pilot symbols of the CPICH. Using Equation 1 and 67, we can write the 1×BN multi-user chip vector, transmitted by the j-th base-station, starting at delay a=0, as follows: x ₀ ^(j) =s _(d) ^(j) C _(d) ^(j) +s _(p) ^(j) C _(p) ^(j)  (11) where s_(d) ^(j) is the j-th base-station's 1×K_(j)B multi-user data symbol vector that stacks the data symbol vectors of the different active users served by the j-th base-station: s_(d) ^(j)=[s₁ ^(j) . . . s_(K) _(j) ^(j)] and s_(k) ^(j) is the 1×B data symbol vector that stacks the data symbols of the k-th user served by the j-th base-station: s_(k) ^(j)=[s_(k) ^(j)[0]s_(k) ^(j)[1] . . . s_(k) ^(j)[B−1]] The 1×B transmitted pilot symbol vector of the j-th base-station s_(p) ^(j) is similarly defined as s_(k) ^(j). The K_(j)B×BN multi-user composite code matrix for the j-th base-station stacks the composite code matrices of the different active users: C_(d) ^(j)=[C₁ ^(j) ^(T) . . . C_(K) _(j) ^(j) ^(T) ]^(T) where C_(k) ^(j) is the B×BN composite code matrix of the k-th user served by the j-th base-station

$C_{k}^{j} = \begin{bmatrix} {c_{k}^{j}\lbrack 0\rbrack} & \; & \; \\ \; & ⋰ & \; \\ \; & \; & {c_{k}^{j}\left\lbrack {B - 1} \right\rbrack} \end{bmatrix}$ and c_(k) ^(j)[i] is the k-th user's composite code vector for the j-th base-station used to spread and scramble the data symbol s_(k) ^(j)[i]: c _(k) ^(j) [i]=[c _(k) ^(j)[(i mod ρ)N] . . . c _(k) ^(j)[(i mod ρ)N+N−1]] It is important to note that the k-th user's composite code vector c_(k) ^(j)[i] can be written as the component-wise multiplication of the user specific short spreading code vector ć_(k) ^(j) and the base-station specific long scrambling code vector c_(s) ^(j)[i]: c _(k) ^(j) [i]=ć _(k) ^(j) {circle around (·)}c _(s) ^(j) [i]  (12) where ć_(k) ^(j)=[ć_(k) ^(j)[0] . . . ć_(k) ^(j)[N−1]] stacks the Walsh-Hadamard spreading code coefficients for the k-th user served by the j-th base-station and c_(s) ^(j)[i] is simlarly defined as c_(k) ^(j)[i]. The pilot composite code matrix C_(p) ^(j), the pilot composite code vector c_(p) ^(j)[i] and the pilot spreading code vector ć_(p) ^(j) of the j-th base-station are similarly defined as C_(k) ^(j), c_(k) ^(j)[i] and ć_(k) ^(j) respectively.

The vector x₀ ^(j) is a row of every input matrix from the set {X_(a) _(j) }_(a) _(j) _(=δ) _(j) _(−Q) ^(δ) _(j) ^(+L) _(j) and is therefore contained in every output matrix from the set {Y_(a) _(j) }_(a) _(j) _(=δ) _(j) _(−Q) ^(δ) _(j) ^(+L) _(j) . For this reason, x₀ ^(j) can be determined from the set of A^(j) output matrices {Y_(a) _(j) }_(a) _(j) _(=A) ₁ _(j) ^(A) ² ^(j) , with δ_(j)−Q≦A₁ ^(j)≦a_(j)≦A₂ ^(j)≦δ_(j)+L_(j) (A^(j)=A₂ ^(j)−A₁ ^(j)+1). The A^(j)(Q+1)M×A^(j)BN super output matrix y^(i) corresponding to the j-th base-station is then defined as follows:

$y^{i} = {\begin{bmatrix} Y_{A_{1}^{j}} & \; & \; \\ \; & ⋰ & \; \\ \; & \; & Y_{A_{2}^{j}} \end{bmatrix}.}$ The B×A^(j)BN super pilot composite code matrix C_(p) ^(j) corresponding to the j-th base-station is defined by: C_(p) ^(j)=[C_(p) ^(j) . . . C_(p) ^(j)] while the K_(j)B×A^(j)BN super multi-user composite code matrix C_(d) ^(j) corresponding to the j-th base-station is given by: C_(d) ^(j)=[C_(d) ^(j) . . . C_(d) ^(j)]

G.2.b Pilot-aided block RAKE receiver. The j-th branch of the pilot-aided block RAKE receiver estimates the desired user's data symbol vector s_(k) _(j) ^(j) (we assume the desired user to be the k_(j)-th user of the j-th base-station multiplex) from Y_(δ) _(j) , with Q=L_(j), based on the knowledge of the j-th base-station's composite code matrix for the desired user C_(k) _(j) ^(j), pilot composite code matrix C_(p) ^(j) and pilot symbol vector s_(p) ^(j).

The pilot-aided channel estimation part of the j-th branch exploits the knowledge of the pilot composite code matrix and the pilot symbol vector to determine the j-th base-station's space-time channel coefficients [22]. The columns of the (L_(j)+1)M×B matrix V^(j) contain the initial estimates of j-th base-station's spacetime channel vector at the different symbol instants. V^(j) is determined as follows: V ^(j)=(Y _(δ) _(j) C _(p) ^(j) ^(H) ){circle around (·)}S _(p) ^(j*)  (13) where the j-th base-station's (L_(j)+1)M×B pilot symbol matrix S_(p) ^(j) is defined as follows: S_(p) ^(j)=[s_(p) ^(j) ^(T) . . . s_(p) ^(j) ^(T) ]^(T) The (L_(j)+1)M×BN output matrix Y_(δ) _(j) is first despread with the j-th base-station's pilot composite code matrix C_(p) ^(j). The (L_(j)+1)M×B despread output matrix Y_(δ) _(j) C_(p) ^(j) ^(H) is then component-wise multiplied with the complex conjugate of the (L_(j)+1)M×B pilot symbol matrix S_(p) ^(j) to remove the effect of the pilot symbol modulation. The final estimate of the j-th base-station's space-time channel vector is the average of the different initial estimates:

$\begin{matrix} {v^{j} = {\frac{1}{B}V^{j}1_{B \times 1}}} & (14) \end{matrix}$

This final estimate of the space-time channel vector v^(j) may subsequently be used to extract the desired user's soft symbol decisions in the j-th branch: s _(k) _(j) ^(j)=v^(j) ^(H) Y_(δ) _(j) C_(k) _(j) ^(j) ^(H)   (15) by despreading the coherently combined output matrix v^(j) ^(H) Y_(δ) _(j) with the desired user's composite code matrix C_(k) _(j) ^(j).

The final soft symbol decisions for the desired user are then obtained by averaging the soft symbol decisions from each of the J branches:

$\begin{matrix} {{\overset{\_}{s}}_{du} = {\frac{1}{J}{\sum\limits_{j = 1}^{J}{\overset{\_}{s}}_{k_{j}}^{j}}}} & (16) \end{matrix}$ Finally, the soft decisions s _(du) are fed into a decision device that determines the nearest constellation point. This results in the hard decisions ŝ_(du).

G.2.c Fully-trained block chip-level equalizer receiver. The j-th branch of the fully-trained block chip-level equalizer receiver estimates the desired user's data symbol vector s_(k) _(j) ^(j) (we assume the desired user to be the k_(j)-th user of the j-th base-station multiplex) from Y_(a) _(j) , with δ_(j)−Q≦a_(j)≦δ_(j)+L_(j), based on the knowledge of the j-th base-station's composite code matrix for the desired user C_(k) _(j) ^(j) and multi-user chip vector x₀ ^(j). Although the fully-trained block chip-level equalizer receiver has zero spectral efficiency and is therefore useless in practice, its BER performance serves well as a lower bound on the BER probability of the proposed block chip equalizer receivers [14].

We assume first, for the sake of clarity, there is no additive noise present in Y_(a) _(j) (δ_(a) _(j) −Q≦a_(j)≦δ_(j)+L_(j)). Because of assumptions 5 and 6, the rows of Y_(a) _(j) span the row space of X_(a) _(j) . Hence, there exists a 1×(Q+1)M linear fully-trained chip-level equalizer w_(a) _(j) ^(j), for which: w _(a) _(j) ^(j) Y _(a) _(j) −x ₀ ^(j)=0  (17) where w_(a) ^(j) is a ZF linear chiplevel equalizer with (Q+1)M−r degrees of freedom.

In order to derive an identifiability condition on w_(a) _(j) ^(j), we observe that the ZF problem in Equation 17 can be recast into the following equivalent ZF problem:

${\left\lbrack {{\overset{\sim}{w}}_{a_{j}}^{j}❘1} \right\rbrack\left\lbrack \frac{X_{a_{j}}}{- x_{0}^{j}} \right\rbrack} = 0$ because the rows of X_(a) _(j) span the row space of Y_(a) _(j) . In order to guarantee the uniqueness of the solution for {tilde over (w)}_(a) _(j) ^(j), the right-hand side matrix of the above equation should have at most a one-dimensional left null space. This leads to the following identifiability condition: BN≧r  (18) which states that the number of observed chip samples should be larger than the system order r.

Let us now assume that additive noise is present in Y_(a) _(j) (δ_(j)−Q≦a_(j)≦δ_(j)+L_(j)). We then solve the following Least Squares (LS) minimisation problem:

$\begin{matrix} {{\overset{\sim}{w}}_{aj}^{j} = {\arg\mspace{14mu}{\min\limits_{w_{aj}^{j}}{{{w_{aj}^{j}Y_{aj}} - x_{0}^{j}}}^{2}}}} & (19) \end{matrix}$ which can be interpreted as follows. The equalized output matrix for the j-th base-station w_(a) _(j) ^(j)Y_(a) _(j) should be as close as possible to the known multi-user chip vector of the j-th base-station x₀ ^(j) in a Least Squares sense.

The obtained fully-trained chiplevel equalizer w _(a) ^(j) may subsequently be used to extract the desired user's soft symbol decisions in the j-th branch: s _(k) _(j) ^(j)= w _(a) _(j) ^(j)Y_(a) _(j) C_(k) _(j) ^(j) ^(H)   (20) The final soft symbol decisions for the desired user are then obtained by averaging the soft symbol decisions from each of the J branches:

$\begin{matrix} {{\overset{\_}{s}}_{du} = {\frac{1}{J}{\sum\limits_{j - 1}^{J}{\overset{\_}{s}}_{k_{j}}^{j}}}} & (21) \end{matrix}$ Finally, the soft decisions s _(du) are fed into a decision device that determines the nearest constellation point. This results in the hard decisions ŝ_(du).

G.2.d Pilot-trained block chip-level equalizer receiver. The j-th branch of the pilot-trained block chip-level equalizer receiver estimates the desired user's data symbol vector s_(k) _(j) ^(j) (we assume the desired user to be the k_(j)-th user of the j-th base-station multiplex) from Y_(a) _(j) , with δ_(j)−Q≦a_(j)≦δ_(j)+L_(j), based on the knowledge of the j-th base-station's composite code matrix for the desired user C_(k) _(j) ^(j), pilot composite code matrix C_(p) ^(j) and pilot symbol vector s_(p) ^(j).

We assume first, for the sake of clarity, there is no additive noise present in Y_(a) _(j) (δ_(j)−Q≦a_(j)≦δ_(j)+L_(j)). Because of assumptions 5 and 6, the rows of Y_(a) _(j) span the row space of X_(a) _(j) _(j) . Hence, there exists a 1×(Q+1)M linear pilot-trained chip-level equalizer g_(a) _(j) ^(j), for which: g _(a) _(j) ^(j) Y _(a) _(j) −x ₀ ^(j)=0 where g_(a) ^(j) is a ZF linear chip-level equalizer with (Q+1)M−r degrees of freedom. By despreading the above equation with the pilot composite code matrix C_(p) ^(j) and by using Equation 70 we can then write: g _(a) _(j) ^(j) Y _(a) _(j) C _(p) ^(j) ^(H) −s _(p) ^(j)=0  (22) because C_(d) ^(j)C_(p) ^(j) ^(H) =0_(k) _(j) _(k×K) due to the orthogonality of the user and pilot code sequences at each symbol instant and because C_(p) ^(j)C_(p) ^(j) ^(H) =I_(B) due to the normalisation of the pilot code sequence at each symbol instant.

In order to derive an identifiability condition on g_(a) _(j) ^(j), we observe that the ZF problem in Equation 22 can be recast into the following equivalent ZF problem:

${\left\lbrack {{\overset{\sim}{g}}_{a_{j}}^{j}❘1} \right\rbrack\left\lbrack \frac{X_{a_{j}}C_{p}^{j^{H}}}{- s_{p}^{j}} \right\rbrack} = 0$ because the rows of X_(a) _(j) span the row space of Y_(a) _(j) . In order to guarantee the uniqueness of the solution for {tilde over (g)}_(a) _(j) ^(j), the right-hand side matrix of the above equation should have at most a one-dimensional left null space. This leads to the following identifiability condition: B≧r  (23) which states that the block length B should be larger than the system order r.

Note that Equation 22 can be derived for all A^(j) processing delays a=A₁ ^(j), A₁ ^(j)+1, . . . , A₂ ^(j) and these results can be combined, leading to:

${{\frac{1}{A^{j}}g^{j}y^{j}C_{p}^{j^{H}}} - s_{p}^{j}} = 0$ where g^(j) is the 1×A^(j)(Q+1)M linear pilot-trained super chip-level equalizer for the j-th base-station: g^(j)=[g_(A) ₁ _(j) ^(j) . . . g_(A) ₂ _(j) ^(j)] For the pilot-trained receiver, choosing A^(j)>1 corresponds to taking a larger Q. For this reason, we will only focus on one processing delay (A^(j)=1).

Let us now assume that additive noise is present in Y_(a) _(j) (δ_(j)−Q≦a_(j)≦δ_(j)+L_(j)). We then solve the following Least Squares (LS) minimisation problem:

$\begin{matrix} {{\overset{\_}{g}}_{a_{j}}^{j} = {\arg\;{\underset{g_{a_{j}}^{j}}{\;\min}{{{g_{a_{j}}^{j}Y_{a_{j}}C_{p}^{j^{H}}} - s_{p}^{j}}}^{2}}}} & (24) \end{matrix}$ which can be interpreted as follows. The equalized output matrix for the j-th base-station g_(a) _(j) ^(j)Y_(a) _(j) is despread with the j-th base-station's pilot composite code matrix C_(p) ^(j). The equalized output matrix after despreading g_(a) _(j) ^(j)Y_(a) _(j) C_(p) ^(j) ^(H) should then be as close as possible to the known pilot symbol vector of the j-th base-station s_(p) ^(j) in a Least Squares sense.

It is easy to prove that the LS problem in Equation 24 can be rewritten as follows:

$\begin{matrix} {{\overset{\_}{g}}_{aj}^{j} = {\arg\mspace{14mu}{\min\limits_{g_{aj}^{j}}{{{g_{aj}^{j}{Y_{aj}\left( {C_{p}^{j^{H}}C_{p}^{j}} \right)}} - {s_{p}^{j}C_{p}^{j}}}}^{2}}}} & (25) \end{matrix}$ showing that the equalized output matrix g_(a) _(j) ^(j)Y_(a) _(j) is first projected on the subspace spanned by the j-th base-station's pilot composite code matrix C_(p) ^(j). The equalized output matrix after projecting g_(a) _(j) ^(j)Y_(a) _(j) (C_(p) ^(j) ^(H) C_(p) ^(j)) should then be as close as possible to the known pilot chip vector of the j-th base-station s_(p) ^(j)C_(p) ^(j) in a Least Squares sense.

The user-specific detection part of the pilot-trained block chip-level equalizer receiver is very similar to the one of the fully-trained block chip-level equalizer receiver. Equations 20 and 75 remain valid if we replace w _(a) _(j) ^(j) by g _(a) _(j) ^(j).

G.2.e Enhanced pilot-trained block chip-level equalizer receiver. The j-th branch of the enhanced pilot-trained block chip-level equalizer receiver estimates the desired user's data symbol vector s_(k) _(j) ^(j) from Y_(a) _(j) , with δ_(j)−Q≦a_(j)≦δ_(j)+L_(j), based on the knowledge of the j-th base-station's multi-user composite code matrix C_(d) ^(j), pilot composite code matrix C_(p) ^(j) and pilot symbol vector s_(p) ^(j).

We assume first, for the sake of clarity, there is no additive noise present in Y_(a) _(j) (δ_(j)−Q≦a_(j)≦δ_(j)+L_(j)). Because of assumptions 5 and 6, the rows of Y_(a) _(j) span the row space of X_(a) _(j) . Hence, there exists a 1×(Q+1)M linear enhanced pilot-trained chip-level equalizer f_(a) _(j) ^(j), for which: f _(a) _(j) ^(j) Y _(a) _(j) −x ₀ ^(j)=0 where f_(a) _(j) ^(j) is a ZF linear chip-level equalizer with (Q+1)M−r degrees of freedom (hence, this linear chip-level equalizer is only unique when (Q+1)M=r). Using Equation 70 we can then write: f _(a) _(j) ^(j) Y _(a) _(j) −s _(d) ^(j) C _(d) ^(j) −s _(p) ^(j) C _(p) ^(j)=0  (26) which is a ZF problem in both the equalizer vector f_(a) _(j) ^(j) and the multi-user data symbol vector s_(d) ^(j).

In order to derive an identifiability condition on f_(a) _(j) ^(j),s_(d) ^(j), we observe that the ZF problem in Equation 26 can be recast into the following equivalent ZF problem:

${\left\lbrack {{\overset{\sim}{f}}_{a_{j}}^{j}{s_{d}^{j}}1} \right\rbrack\begin{bmatrix} X_{a_{j}} \\ {- C_{d}^{j}} \\ {{- s_{p}^{j}}C_{p}^{j}} \end{bmatrix}} = 0$ because the rows of X_(a) _(j) span the row space of Y_(a) _(j) . In order to guarantee the uniqueness of the solution for {tilde over (f)}_(a) _(j) ^(j), s_(d) ^(j), the right-hand side matrix of the above equation should have at most a one-dimensional left null space. This leads to the following identifiability condition: B(N−K _(j))≧r  (27) which states that the block length B times the number of unused spreading codes N−K_(j) should be larger than the system order r. Therefore the maximum number of users that can be supported is K_(j)=N−1, where N is the spreading factor.

Note that Equation 26 can be derived for all A^(j) processing delays a=A₁ ^(j),A₁ ^(j)+1, . . . , A₂ ^(j) and these results can be combined, leading to: f ^(j) y ^(j) −s _(d) ^(j) C _(d) ^(j) −s _(p) ^(j) C _(p) ^(j)=0 where f^(j) is the 1×A^(j)(Q+1)M linear enhanced pilot-trained super chip-level equalizer for the j-th base-station: f^(j)=[f_(A) ₁ _(j) ^(j) . . . f_(A) ₂ _(j) ^(j)] For the enhanced pilot-trained receiver, taking A^(j)>1 does not correspond to taking a larger Q but to the mutually referenced equalizer (MRE) approach presented in [23]. However, the performance improvements obtained by taking A^(j)>1 come at a very high cost. For this reason, we will only focus on one processing delay (A^(j)=1).

Let us now assume that additive noise is present in Y_(a) _(j) (δ_(j)−Q≦a_(j)≦δ_(j)+L_(j)). We then solve the following Least Squares (LS) minimisation problem:

$\begin{matrix} {{\overset{\_}{f}}_{a_{j}}^{j},{{\overset{\_}{s}}_{d}^{j} = {\arg{\;\;}{\min\limits_{f_{a_{j}}^{j},s_{d}^{j}}{{{f_{a_{j}}^{j}Y_{a_{j}}} - {s_{d}^{j}C_{d}^{j}} - {s_{p}^{j}C_{p}^{j}}}}^{2}}}}} & (28) \end{matrix}$ Since the LS cost function is a quadratic form in f_(a) _(j) ^(j),s_(d) ^(j), the minimisation can be done independently for f_(a) _(j) ^(j) and s_(d) ^(j). In order to obtain a direct equalizer estimation, we first solve for s_(d) ^(j), assuming f_(a) _(j) ^(j) to be known and fixed. The LS solution for s_(d) ^(j) can be simplified to: s _(d) ^(j) =f _(a) _(j) ^(j) Y _(a) _(j) C _(d) ^(j) ^(H)   (29) because C_(d) ^(j)C_(d) ^(j) ^(H) =I_(K) _(j) _(B) and C_(p) ^(j)C_(d) ^(j) ^(H) =0_(B×K) _(j) _(B) due to the orthogonality of the user code sequences and the pilot code sequence at each symbol instant. Substituting s _(d) ^(j) into the original LS problem of Equation 28, leads to a modified LS problem in f_(a) _(j) ^(j) only:

$\begin{matrix} {{\overset{\_}{f}}_{aj}^{j} = {\arg\mspace{14mu}{\min\limits_{f_{aj}^{j}}{{{f_{aj}^{j}{Y_{aj}\left( {I_{BN} - {C_{d}^{j^{H}}C_{d}^{j}}} \right)}} - {s_{p}^{j}C_{p}^{j}}}}^{2}}}} & (30) \end{matrix}$ which can be interpreted as follows. The equalized output matrix for the j-th base-station f_(a) _(j) ^(j)Y_(a) _(j) is projected on the orthogonal complement of the subspace spanned by the j-th base-station's multi-user composite code matrix C_(d) ^(j). The equalized output matrix after projecting f_(a) _(j) ^(j)Y_(a) _(j) (I_(BN)−C_(d) ^(j) ^(H) C_(d) ^(j)) should then be as close as possible to the known pilot chip vector of the j-th base-station s_(p) ^(j)C_(p) ^(j) in a Least Squares sense. Note that when K_(j)=N−1, I_(BN)−C_(d) ^(j) ^(H) C_(d) ^(j)=C_(p) ^(j) ^(h) C_(p) ^(j) and Equation 30 reduces to Equation 25. This means that for a fully loaded system (K_(j)=N−1) the enhanced pilot-trained approach is exactly the same as the pilot-trained approach. This is also indicated by the identifiability conditions 23 and 27.

It is easy to prove that the modified LS problem of Equation 30 can be rewritten as follows:

$\begin{matrix} {{\overset{\_}{f}}_{aj}^{j} = {\arg\mspace{14mu}{\min\limits_{f_{aj}^{j}}\left\{ {{{{f_{aj}^{j}Y_{aj}C_{p}^{j^{H}}} - s_{p}^{j}}}^{2} + {{f_{aj}^{j}{Y_{aj}\left( {I_{BN} - {C_{d}^{j^{H}}C_{d}^{j}} - {C_{p}^{j^{H}}C_{p}^{j}}} \right)}}}^{2}} \right\}}}} & (31) \end{matrix}$ showing that the enhanced pilot-trained LS problem naturally decouples into two different parts: a training-based part and a fully-blind part. On the one hand, the training-based part (described by the first term of Equation 31) corresponds to the pilot-trained chip equalizer of Equation 24. On the other hand, the fully-blind part (described by the second term of Equation 31) projects the equalized output matrix f_(a) _(j) ^(j)Y_(a) _(j) on the orthogonal complement of the subspace spanned by both the j-th base-station's multi-user composite code matrix C_(d) ^(j) and the j-th base-station's pilot composite code matrix C_(p) ^(j). Furthermore, the energy in the equalized output matrix after projecting f_(a) _(j) ^(j)Y_(a) _(j) (I_(BN)−C_(d) ^(j) ^(H) C_(d) ^(j)−C_(p) ^(j) ^(H) C_(p) ^(j)) should be as small as possible in a Least Squares sense which actually corresponds to the Minimum Output Energy (MOE) criterion of [18] and [19]. For this reason, the enhanced pilot-trained chip equalizer is actually a semi-blind chip-level equalizer.

The user-specific detection part of the enhanced pilot-trained block chip equalizer receiver is very similar to the one of the fully-trained block chip equalizer receiver. Equations 20 and 75 remain valid if we replace w _(a) _(j) ^(j) by f _(a) _(j) ^(j).

G.3 Space-Time Adaptive Chip-Level Equalizer Receivers

Up till now, we have only discussed the block version of the different chip equalizer receivers. It addresses a block of B symbols at once and is only suited for a block fading channel, that remains constant during the entire duration of the block. Since in practice, the multi-path fading channel is time-varying, we have to devise an adaptive version as well that updates the equalizer coefficients on the fly. In this section, we derive an adaptive version of the different chip-level equalizer receivers from their corresponding block version and provide complexity figures for each of them. The adaptive receivers address a single symbol at once and consist of J parallel branches, one for each base-station connected to the mobile station of interest. The j-th branch basically consists of two parts: a pilot-aided updating part and a user specific detection part.

G.3.a Preliminary definitions. We introduce the following (Q+1)M×N output matrix block Y_(a)[i];

$\begin{matrix} {{Y_{a}\lbrack i\rbrack} = \begin{bmatrix} {y\left\lbrack {{i\; N} + a} \right\rbrack} & \cdots & {y\left\lbrack {{\left( {i + 1} \right)N} + a - 1} \right\rbrack} \\ {\vdots\;} & \; & \vdots \\ {y\left\lbrack {{iN} + a + Q} \right\rbrack} & \cdots & {y\left\lbrack {{\left( {i + 1} \right)N} + a + Q - 1} \right\rbrack} \end{bmatrix}} & (32) \end{matrix}$ where i represents the symbol instant and a the processing delay.

The (L_(j)+1)M×1 pilot symbol vector for the j-th base-station at the i-th symbol instant is defined as follows: s_(p) ^(j)[i]=[s_(p) ^(j)[i] . . . s_(p) ^(j)[i]]^(T)

The 1×N multi-user chip vector transmitted by the j-th base-station at the i-th symbol instant starting at delay a is defined by: x _(a) ^(j) [i]=[x ^(j) [iN+a] . . . x ^(j)[(i+1)N+a−1]]

The K_(j)×N multi-user composite code matrix C_(d) ^(j)[i] stacks the j-th base-station's active user composite code vectors at the i-th symbol instant: C_(d) ^(j)[i]=[c₁ ^(j)[i]^(T) . . . c_(K) _(j) ^(j)[i]^(T)]^(T) Note that the K_(j)×N multi-user spreading code matrix Ć_(d) ^(j) can be similarly defined as C_(d) ^(j)[i], stacking the j-th base-station's active user spreading code vectors for each symbol instant.

G.3.b Pilot-aided adaptive RAKE receiver. The j-th branch of the pilot-aided adaptive RAKE receiver estimates the desired user's data symbol s_(k) _(j) ^(j)[i] (we assume the desired user to be the k_(j)-th user of the j-th base-station multiplex) from the set Y_(δ) _(j) [0], Y_(δ) _(j) [1], . . . , Y_(δ) _(j) [i], with Q=L_(j), based on the knowledge of the j-th base-station's composite code vectors for the desired user c_(k) _(j) ^(j)[0],c_(k) _(j) ^(j)[1], . . . ,c_(k) _(j) ^(j)[i] pilot composite code vectors c_(p) ^(j)[0],c_(p) ^(j)[1], . . . ,c_(p) ^(j)[i] and pilot symbols s_(p) ^(j)[0],s_(p) ^(j)[1], . . . ,s_(p) ^(j)[i].

The pilot-aided channel updating part of the j-th branch exploits the knowledge of the pilot composite code vectors and pilot symbols of the corresponding j-th base-station [22]. It continuously updates the j-th base-station's space-time channel vector at the symbol rate. By having a closer look at the corresponding block processing algorithm in Equation 13, it is rather straightforward to derive an adaptive processing algorithm for the j-th branch. The new estimate of the space-time channel vector is the weighted sum of the old estimate and some correction term: v ^(j) [i]=(1−α)v ^(j) [i−1]+α(Y _(δ) _(j) [i]c _(p) ^(j) [i] ^(H)){circle around (·)}s _(p) ^(j) [i]*  (33) The correction term is the component-wise multiplication of the new despread output matrix block Y_(δ) _(j) [i]c_(p) ^(j)[i]^(H) and the complex conjugate of the new pilot symbol vector s_(p) ^(j)[i]. The pilot-aided channel updating part for the j-th branch consists of three computational steps per time update, namely first a pilot despreading step, then a pilot symbol modulation removal step and finally an updating step. The pilot despreading step requires O(L_(j)MN) computations. The pilot symbol modulation removal step as well as the updating step require O(L_(j)M) computations. So, the pilot-aided channel updating part requires

$\left( {\sum\limits_{j = 1}^{J}{L_{j}{MN}}} \right)$ computations in total for all J branches. By setting L=max_(j) L_(j), we can upperbound this complexity figure by O(JLMN).

The user-specific detection part of the j-th branch utilizes the updated space-time channel vector, provided by the channel updating part, to coherently combine the received output matrix block Y_(δ) _(j) [i]. It subsequently generates a soft decision s _(k) _(j) ^(j)[i] about the desired user's i-th symbol: s _(k) _(j) ^(j)[i]=v^(j)[i]^(H)Y_(δ) _(j) [i]c_(k) _(j) ^(j)[i]^(H)  (34) by despreading the coherently combined output matrix block v^(j)[i]^(H)Y_(δ) _(j) [i] with the desired user's composite code vector at the i-th symbol instant c_(k) _(j) ^(j)[i]. The user-specific detection part consists of two computational steps per time update, namely first a coherent combining step and then a correlation step. On the one hand, the coherent combining step requires O(NL_(j)M) computations. On the other hand, the correlation step requires O(N) computations. So, the user-specific detection part requires

$\left( {\sum\limits_{j = 1}^{J}{{NL}_{j}M}} \right)$ computations in total for all J branches. With L=max_(j) L_(j), we finally arrive at O(JNLM) computations.

The final soft decision s _(du)[i] about the desired user's i-th symbol is then obtained with O(J) complexity by averaging the soft decisions of the different branches:

$\begin{matrix} {{{\overset{\_}{s}}_{du}\lbrack i\rbrack} = {\frac{1}{J}{\sum\limits_{j = 1}^{J}{{\overset{\_}{s}}_{k_{j}}^{j}\lbrack i\rbrack}}}} & (35) \end{matrix}$ Finally, the soft decision s _(du)[i] is fed into a decision device, that determines the nearest constellation point. This results in the hard decision ŝ_(du)[i].

G.3.c Fully-trained adaptive chip-level equalizer receiver. The j-th branch of the fully-trained adaptive chip-level equalizer receiver estimates the desired user's data symbol s_(k) _(j) ^(j)[i] (we assume the desired user to be the k_(j)-th user of the j-th base-station multiplex) from the set Y_(a) _(j) [0], Y_(a) _(j) [1], . . . , Y_(a) _(j) [i], with δ_(j)−Q≦a_(j)≦δ_(j)+L_(j), based on the knowledge of the j-th base-station's composite code vectors for the desired user c_(k) _(j) ^(j)[0],c_(k) _(j) ^(j)[1], . . . ,c_(k) _(j) ^(j)[i] and multi-user chip vectors x₀ ^(j)[0], x₀ ^(j)[1], . . . , x₀ ^(j)[i]. Although the fully-trained adaptive chip-level equalizer receiver is useless in practice, its BER performance serves well as a lower bound on the BER probability of the proposed adaptive chip-level equalizer receivers [14].

The fully-trained updating part of the j-th branch exploits the knowledge of the multi-user chip vectors of the corresponding j-th base-station. It continously updates the equalizer coefficients at the symbol rate using a Square Root Information (SRI) RLS type of adaptive algorithm [24]. By having a closer look at the corresponding LS block processing algorithm in Equation 19, it is indeed rather straightforward to derive an RLS type of adaptive processing algorithm for the j-th branch. The new incoming output matrix block Y_(a) _(j) [i] is used together with the new multi-user chip sequence vector x₀ ^(j)[i] in a QRD-updating step:

$\begin{matrix} \left. \begin{bmatrix} {R_{a_{j}}^{j}\lbrack i\rbrack} & 0_{{({Q + 1})}M \times 1} & 0_{{({Q\; + 1})}M \times {({N - 1})}} \\ {z_{a_{j}}^{j}\lbrack i\rbrack} & \bigstar & 0_{1 \times {({N - 1})}} \end{bmatrix}\leftarrow{\quad{\begin{bmatrix} {\lambda\;{R_{a_{j}}^{j}\left\lbrack {i - 1} \right\rbrack}} & {Y_{a_{j}}\lbrack i\rbrack} \\ {\lambda\;{z_{a_{j}}^{j}\left\lbrack {i - 1} \right\rbrack}} & {x_{0}^{j}\lbrack i\rbrack} \end{bmatrix} \cdot {T_{a_{j}}^{j}\lbrack i\rbrack}}} \right. & (36) \end{matrix}$ In this equation, λ is the forget factor, that should be chosen in correspondence with the coherence time of the time-varying channel. T_(a) _(j) ^(j)[i] represents the orthogonal updating matrix at the i-th symbol instant that can be written as a sequence of Givens transformations [24]. The fully-trained QRD-updating step tracks a (Q+1)M×(Q+1)M lower-triangular factor R_(a) _(j) ^(j)[i] and a corresponding 1×(Q+1)M right-hand side z_(a) _(j) ^(j)[i]. The new values for the fully-trained chip equalizer coefficients of the j-th branch W_(a) _(j) ^(j)[i] then follow from the backsubstitution step: w _(a) _(j) ^(j) [i]·R _(a) _(j) ^(j) [i]=z _(a) _(j) ^(j) [i]  (37) The fully-trained SRI-RLS updating part for the j-th branch consists of two computational steps per time update, namely first a triangular QRD-updating step and then a triangular backsubstitution step. On the one hand, the QRD-updating step amounts to triangularizing a structured matrix, namely a triangular matrix plus N extra columns, and requires O(NQ²M²) computations. On the other hand, the triangular backsubstitution step requires O(Q²M²) computations. So, the fully-trained updating part requires O(JNQ²M²) computations in total for all J branches.

The user-specific detection part of the j-th branch utilizes the updated chip equalizer coefficients w_(a) _(j) ^(j)[i], provided by the updating part, to equalize the received output matrix block Y_(a) _(j) [i]. It subsequently generates a soft decision s _(k) _(j) ^(j)[i] about the desired user's i-th symbol: s _(k) _(j) ^(j)[i]=w_(a) _(j) ^(j)[i]Y_(a) _(j) [i]c_(k) _(j) ^(j)[i]^(H)  (38) by despreading the equalized output matrix block w_(a) _(j) ^(j)[i]Y_(a) _(j) [i] with the desired user's composite code vector at the i-th symbol instant c_(k) _(j) ^(j)[i]. The user-specific detection part also consists of two computational steps per time update, namely first an equalization step and then a correlation step. On the one hand, the equalization step requires O(NQM) computations. On the other hand, the correlation step requires O(N) computations. So, the user-specific detection part requires O(JNQM) computations in total for all J branches.

The final soft decision s _(du)[i] about the desired user's i-th symbol is then obtained with O(J) complexity by averaging the soft decisions of the different branches:

$\begin{matrix} {{{\overset{\_}{s}}_{du}\lbrack i\rbrack} = {\frac{1}{J}{\sum\limits_{j = 1}^{J}{{\overset{\_}{s}}_{k_{j}}^{j}\lbrack i\rbrack}}}} & (39) \end{matrix}$ Finally, the soft decision s _(du) [i] is fed into a decision device, that determines the nearest constellation point. This results in the hard decision ŝ_(du)[i].

G.3.d Pilot-trained adaptive chip-level equalizer receiver. The j-th branch of the pilot-trained adaptive chip-level equalizer receiver estimates the desired user's data symbol s_(k) _(j) ^(j)[i] (we assume the desired user to be the k_(j)-th user of the j-th base-station multiplex) from the set Y_(a) _(j) [0], Y_(a) _(j) [1], . . . , Y_(a) _(j) [i], with δ_(j)−Q≦a_(j)≦δ_(j)+L_(j), based on the knowledge of the j-th base-station's composite code vectors for the desired user c_(k) _(j) ^(j)[0],c_(k) _(j) ^(j)[1], . . . ,c_(k) _(j) ^(j)[i], pilot composite code vectors c_(p) ^(j)[0],c_(p) ^(j)[1], . . . ,c_(p) ^(j)[i] and pilot symbols s_(p) ^(j)[0],s_(p) ^(j) [1], . . . ,s_(p) ^(y)[i].

The pilot-trained updating part of the j-th branch exploits the knowledge of the pilot composite code vectors and pilot symbols of the corresponding j-th base-station [25]. It continously updates the equalizer coefficients at the symbol rate using a Square Root Information (SRI) RLS type of adaptive algorithm [24]. By having a closer look at the corresponding LS block processing algorithm in Equation 24, it is indeed rather straightforward to derive an RLS type of adaptive processing algorithm for the j-th branch. The new incoming output matrix block Y_(a) _(j) [i] is first despread with the j-th base-station's pilot composite code vector for the i-th symbol instant c_(p) ^(j)[i]. The new despread output matrix block Y_(a) _(j) [i]c_(p) ^(j)[i]^(H) is then used together with the new pilot symbol s_(p) ^(j)[i] in a QRD-updating step:

$\begin{matrix} \left. \begin{bmatrix} {R_{a_{j}}^{j}\lbrack i\rbrack} & 0_{{({Q + 1})}M \times 1} \\ {z_{a_{j}}^{j}\lbrack i\rbrack} & \bigstar \end{bmatrix}\leftarrow{\quad{\begin{bmatrix} {\lambda\;{R_{a_{j}}^{j}\left\lbrack {i - 1} \right\rbrack}} & {{Y_{a_{j}}\lbrack i\rbrack}{c_{p}^{j}\lbrack i\rbrack}^{H}} \\ {\lambda\;{z_{a_{j}}^{j}\left\lbrack {i - 1} \right\rbrack}} & {s_{p}^{j}\lbrack i\rbrack} \end{bmatrix} \cdot {T_{a_{j}}^{j}\lbrack i\rbrack}}} \right. & (40) \end{matrix}$ In this equation, λ is again the forget factor and T_(a) _(j) ^(j)[i] the orthogonal updating matrix. The pilot-trained QRD-updating step tracks a (Q+1)M×(Q+1)M lower-triangular factor R_(a) _(j) ^(j)[i] and a corresponding 1×(Q+1)M right-hand side z_(a) _(j) ^(j)[i]. The new-values for the pilot-trained chip-level equalizer coefficients of the j-th branch g_(a) _(j) ^(j)[i] then follow from the backsubstitution step: g _(a) _(j) ^(j) [i]·R _(a) _(j) ^(j) [i]=z _(a) _(j) ^(j) [i]  (41) The pilot-trained SRI-RLS updating part for the j-th branch consists of three computational steps per time update, namely first a pilot despreading step, then a triangular QRD-updating step and finally a triangular backsubstitution step. The pilot despreading step requires O(QMN) computations. The QRD-updating step amounts to triangularizing a structured matrix, namely a triangular matrix with one extra column, and requires O(Q²M²) computations. The triangular backsubstitution step also requires O(Q²M²) computations. So, the pilot-trained updating part requires O(JQMN+JQ²M²) computations in total for all J branches.

The user-specific detection part of the pilot-trained adaptive chip-level equalizer receiver is very similar to the one of the fully-trained adaptive chip-level equalizer receiver. Equations 38 and 39 remain valid if we replace w_(a) _(j) ^(j)[i] by g_(a) _(j) ^(j)[i].

G.3.e Enhanced pilot-trained adaptive chip-level equalizer receiver. The j-th branch of the enhanced pilot-trained adaptive chip-level equalizer receiver estimates the desired user's data symbol s_(k) _(j) ^(j)[i] (we assume the desired user to be the k_(j)-th user of the j-th base-station multiplex) from the set Y_(a) _(j) [0], Y_(a) _(j) [1], . . . , Y_(a) _(j) [i], with δ_(j)−Q≦a_(j)≦δ_(j)+L_(j), based on the knowledge of the j-th base-station's multi-user composite code matrices C_(d) ^(j)[0],C_(d) ^(j)[1], . . . ,C_(d) ^(j)[i], pilot composite code vectors c_(p) ^(j)[0],c_(p) ^(j)[1], . . . ,c_(p) ^(j)[i] and pilot symbols s_(p) ^(j)[0],s_(p) ^(j)[1], . . . ,s_(p) ^(u)[i].

The enhanced pilot-trained updating part of the j-th branch exploits the knowledge of all composite code vectors (so both pilot and user composite code vectors) and pilot symbols of the corresponding j-th base-station. It continously updates the equalizer coefficients at the symbol rate using a Square Root Information (SRI) RLS type of adaptive algorithm [24]. By having a closer look at the corresponding LS block processing algorithm in Equation 30, it is indeed rather straightforward to derive an RLS type of adaptive processing algorithm for the j-th branch. Each new incoming output matrix block Y_(a) _(j) [i] is first projected on the orthogonal complement of the subspace spanned by the j-th base-station's multi-user composite code matrix C_(d) ^(j)d[i], achieved by the projection matrix P_(d) ^(j)[i]: P _(d) ^(j) [i]=I _(N) −C _(d) ^(j) [i] ^(H) C _(d) ^(j) [i]  (42) The new projected output matrix block Y_(a) _(j) [i]P_(d) ^(j)[i] is then used together with the new pilot chip vector s_(p) ^(j[i]c) _(p) ^(j)[i] in a QRD-updating step:

$\begin{matrix} \left. \begin{bmatrix} {R_{a_{j}}^{j}\lbrack i\rbrack} & 0_{{({Q + 1})}M \times 1} & 0_{{({Q\; + 1})}M \times {({N - 1})}} \\ {z_{a_{j}}^{j}\lbrack i\rbrack} & \bigstar & 0_{1 \times {({N - 1})}} \end{bmatrix}\leftarrow{\quad{\begin{bmatrix} {\lambda\;{R_{a_{j}}^{j}\left\lbrack {i - 1} \right\rbrack}} & {{Y_{a_{j}}\lbrack i\rbrack}{P_{d}^{j}\lbrack i\rbrack}} \\ {\lambda\;{z_{a_{j}}^{j}\left\lbrack {i - 1} \right\rbrack}} & {{s_{p}^{j}\lbrack i\rbrack}{c_{p}^{j}\lbrack i\rbrack}} \end{bmatrix} \cdot {T_{a_{j}}^{j}\lbrack i\rbrack}}} \right. & (43) \end{matrix}$ In this equation λ is again the forget factor and T_(a) _(j) ^(j)[i] the orthogonal updating matrix. The enhanced pilot-trained QRD-updating step tracks a (Q+1)M×(Q+1)M lower-triangular factor R_(a) _(j) ^(j)[i] and a corresponding 1×(Q+1)M right-hand side z_(a) _(j) ^(j)[i]. The new values for the enhanced pilot-trained chip-level equalizer coefficients of the j-th branch f_(a) _(j) ^(j)[i] then follow from the backsubstitution step: f _(a) _(j) ^(j) [i]·R _(a) _(j) ^(j) [i]=z _(a) _(j) ^(j) [i]  (44) The enhanced pilot-trained SRI-RLS updating part for the j-th branch consists of four computational steps per time update, namely first a projection step, then a pilot symbol spreading step, next a triangular QRD-updating step and finally a triangular backsubstitution step. The projection step decomposes into two substeps, namely the projection matrix calculation substep and the projection matrix multiplication substep. The projection matrix calculation substep would normally require O(K_(j)N²) computations. Note however that the multi-user composite code correlation matrix C_(d) ^(j)[i]^(H)C_(d) ^(j)[i] in Equation 42 can be expressed as follows: C _(d) ^(j) [i] ^(H) C _(d) ^(j) [i]=Ć _(d) ^(jH) Ć _(d) ^(j) {circle around (·)}c _(s) ^(j) [i] ^(H) c _(s) ^(j) [i]  (45) showing that it is the component-wise multiplication of the multi-user spreading code correlation matrix Ć_(d) ^(jH)Ć_(d) ^(j) and the scrambling code correlation matrix c_(s) ^(j)[i]^(H)c_(s) ^(j)[i]. Since the multi-user spreading code correlation matrix does not depend on the symbol instant i, it can be precalculated at the base-station and broadcasted to all active mobile stations. This leaves only O(N²) complexity for the projection matrix calculation substep at the mobile station. The projection matrix multiplication substep requires O(QMN²) computations. The pilot symbol spreading step only requires O(N) computations. The QRD-updating step amounts to triangularizing a structured matrix, namely a triangular matrix plus N extra columns, and requires O(NQ²M²) computations. Finally, the triangular backsubstitution step requires O(Q²M²) computations. So, the enhanced pilot-trained updating part requires O(JQMN²+JNQ²M²) computations in total for all J branches.

The user-specific detection part of the enhanced pilot-trained adaptive chip equalizer receiver is very similar to the one of the fully-trained adaptive chip equalizer receiver. Equations 38 and 39 remain valid if we replace w_(a) _(j) ^(j)[i] by f_(a) _(j) ^(j)[i].

G.3.f Complexity comparison. Table I compares the complexity of the different adaptive chip-level equalizer receivers. The user-specific detection part is common to all receivers. Moreover, all receivers have linear complexity in the number of actively tracked base-stations J. The updating part of the pilot-aided adaptive RAKE receiver (PA-RAKE) has linear complexity in both the total number of RAKE fingers LM and the spreading factor N. The updating part of all other receivers has quadratic complexity in the total number of equalizer coefficients QM. The updating part of both the fully-trained (FT-CE) and the pilot-trained (PT-CE) adaptive chip-level equalizer receiver has linear complexity whereas that of the enhanced pilot-trained (EPT-CE) adaptive chip-level equalizer receiver has quadratic complexity in the spreading factor N. In fact, the updating part of the EPT-CE is N times more complex than that of the PT-CE. Note that the complexity figure of the EPT-CE agrees with the complexity figure of the LS channel estimators for long code DS-CDMA systems recently proposed in [26].

C.4 Simulation Results

G.4.a Block processing. The simulations for the block chiplevel equalizer receivers are performed for the downlink of a WCDMA system with J=2 active base-stations, K_(j)=K users per base-station, QPSK data modulation, real orthogonal Walsh-Hadamard spreading codes of length N=12 along with a random overlay code for scrambling whose period measures

TABLE I COMPLEXITY OF ADAPTIVE CHIP-LEVEL EQUALIZER RECEIVERS updating detection PA-RAKE O (JLMN) O (JNLM) FT-CE O (JNQ²M²) O (JNQM) PT-CE O (JQMN + JQ²M²) O (JNQM) EPT-CE O (JQMN² + JNQ²M²) O (JNQM)

Complexity of Adaptive Chip-Level Equalizer Receivers

ρ=B symbols. The j-th base-station's interfering user signals and pilot signal are transmitted with a P_(i) ^(j)=P_(i) dB respectively P_(p) ^(j)=P_(p) dB higher power than the j-th base-station's desired user signal. The mobile station is equipped with the minimum number of receive antennas for chip rate sampling M=J+1=3 to simultaneously track J=2 base-stations. Each base-station's vector channel with order L_(j)=L=3 has M(L+1)=12 Rayleigh distributed channel taps of equal average power and is assumed to be constant during the entire duration of the block (block fading channel). The temporal smoothing factor Q+1 is chosen in correspondence with Equation 9 to be Q+1=JL=6. Note that the system order in this case equals r=J(J+1)L=18. All figures show the average BER versus the average SNR per bit for the pilot-aided block RAKE receiver (PA-RAKE), the pilot-trained (PT-CE), the enhanced pilot-trained (EPT-CE), and the fully-trained (FT-CE) block chip-level equalizer receiver. Also shown in the figures is the single-user bound (SUB) which is the theoretical BER-curve of QPSK with M(L+1)-th order diversity in Rayleigh fading channels [27].

FIG. 10 and 11 compare the performance of the space-time block chip-level equalizer receivers for half system load (K=5) and a block length of B=30 respectively B=60. The relative interference power is P_(i)=0 dB and the relative pilot power is P_(i)=0 dB. For a block length of B=30, shown in FIG. 10, both the EPT-CE and the FT-GE outperform the PA-RAKE for low to high SNR per bit. The PT-CE only outperforms the PA-RAKE at rather high SNR per bit. The EPT-CE achieves a 5.8 dB gain at a BER of 10⁻³ compared to the PT-CE and only exhibits a 1.7 dB loss compared to the ideal FT-CE. For a block length of B=60, this performance gain respectively loss decreases to 2.1 dB respectively 0.8 dB. For large block lengths B→∞, the performance of both the PT-CE and the EPT-CE converges to that of the ideal FT-CE.

FIGS. 12 and 13 compare the performance of the space-time block chip-level equalizer receivers for half system load (K=5) and a relative interference power of P_(i)=−10 dB respectively P_(i)=+10 dB. The block length is B=60 and the relative pilot power is P_(p)=0 dB. For a relative interference power of P_(i)=−10 dB, shown in FIG. 12, the FT-CE approaches the SUB rather closely. For low SNR per bit, the EPT-CE and the PA-RAKE have similar performance. Only at medium to high SNR per bit

$\left( {\frac{E_{b}}{N_{o}} \geq {5\mspace{14mu}{dB}}} \right),$ the EPT-CE outperforms the PA-RAKE. The PT-CE has worse performance than the PA-RAKE. Only at rather high SNR per bit

$\left( {\frac{E_{b}}{N_{o}} \geq {15\mspace{14mu}{dB}}} \right)$ the PT-CE has better performance than the PA-RAKE. For a relative interference power of P_(i)=+10 dB, shown in FIG. 13, the PA-RAKE exhibits an error floor at a BER of 2·10⁻¹. The curves for the PT-CE, the EPT-CE and the FT-CE shift up due to the increased MUI but remain near/far resistant.

FIG. 14 compares the performance of the space-time block chip-level equalizer receivers for full system load (K=11), a block length of B=60, a relative interference power of P_(i)=0 dB and a relative pilot power of P_(p)=0 dB. The PT-CE and the EPT-CE have exactly the same performance (as discussed in Subsection G.2.e) but still outperform the PA-RAKE receiver, that exhibits an error floor at a BER of 5·10⁻². Compared to the ideal FT-CE, the PT-CE and the EPT-CE exhibit a 3.3 dB loss at a BER of 10⁻³.

FIG. 15 compares the performance of the space-time block chip-level equalizer receivers for half system load (K=5), an exponential power delay profile with a decay factor of 3 dB, a block length of B=60, a relative interference power of P_(i)=0 dB and a relative pilot power of P_(p)=0 dB. Compared to the curves in FIG. 11, the different CE receivers exhibit a marginal performance loss.

G.4.b Adaptive processing. The simulations for the adaptive chip-level equalizer receivers are performed for the downlink of a WCDMA system with J=1 active base-station, K_(j)=K equal power users, QPSK data modulation, real orthogonal Walsh-Hadamard spreading codes of length N=12 along with a random overlay code for scrambling whose period measures ρ=10 symbols. The mobile station is equipped with the minimum number of receive antennas for chip rate sampling M=J+1=2 to simultaneously track J=1 base-station. The base-station's time-varying vector channel with order L_(j)=L=3 has M(L+1)=8 Rayleigh distributed channel taps of equal average power with the classical Jakes spectrum and a normalized coherence time of τ_(c)=2000 symbols. FIGS. 16 and 17 compare the performance of the pilot-aided adaptive RAKE receiver (PA-RAKE) with weighting factor

${\alpha = \frac{1}{2}},$ the pilot-trained (PT-CE), the enhanced pilot-trained (EPT-CE) and the fully-trained (FT-CE) adaptive chip-level equalizer receiver for half respectively full system load. The temporal smoothing factor Q+1 is chosen in correspondence with Equation 9 to be Q+1=JL=3. The optimal value of the RLS forget factor proved to be λ=0.95, corresponding to a data memory

$\frac{1}{1 - \lambda}\mspace{20mu}{being}\mspace{20mu}\frac{1}{100}$ of the normalized coherence time τ_(c). Also shown in the figures is the single-user bound (SUB) which is the theoretical BER-curve of QPSK with M(L+1)-th order diversity in Rayleigh fading channels [27].

For half system load, shown in FIG. 16, the EPT-CE outperforms the PT-CE by 2.9 dB and suffers a 1.9 dB loss compared to the FT-CE at a BER of 10⁻³. The PA-RAKE exhibits an error floor at a BER of 10⁻².

For full system load, shown in FIG. 17, the PT-CE and the EPT-CE have exactly the same performance and suffer a 5 dB loss compared to the FT-CE at a BER of 10⁻¹. The PA-RAKE exhibits an error floor at a BER of 4·10⁻².

G.5 Conclusion

In this paper, we proposed new pilot-trained and enhanced pilot-trained space-time chiplevel equalizer receivers for the downlink of WCDMA systems with a continuous code-multiplexed pilot. With MS_(t)S_(p)=J+1 diversity channels at the mobile station, the proposed receivers can simultaneously track J active base-station signals in soft handover mode. These diversity channels can be obtained by either M-fold space diversity, S_(t)-fold temporal oversampling, S_(p)-fold polarization diversity or a combination of the three above mentioned techniques. For instance, with a temporal oversampling factor of S_(t)=2 and S_(p)=2-fold polarization diversity at the receiver, we obtain MS_(t)S_(p)=4 (good for soft handover between J=3 base-stations) with only M=1 receive antenna at the mobile station. For both receivers a Least Squares algorithm for block processing and a QRD-based Recursive Least Squares algorithm for adaptive processing has been derived. The proposed receivers are compared with the conventional pilot-aided space-time RAKE receiver and the ideal fully-trained space-time chip-level equalizer receiver both in terms of performance and complexity.

For full system load, the pilot-trained and the enhanced pilot-trained receiver have exactly the same performance. However, for low to medium system load, the enhanced pilot-trained receiver outperforms the pilot-trained receiver and its performance comes close to that of the ideal fully-trained receiver. Moreover, the enhanced pilot-trained chip-level equalizer can be viewed as a semi-blind chip equalizer, since its cost function can be written as the sum of a training-based term and a fully-blind term.

The Least Squares block processing algorithms on the one hand, detect blocks of B symbols at once. For large block lengths B→∞, the performance of both the pilot-trained and the enhanced pilot-trained block chip equalizer receiver converges to the performance of the fully-trained block chip equalizer receiver. Conversely, the performance gain of the enhanced pilot-trained block chip-level equalizer receiver compared to the pilot-trained block chip-level equalizer receiver increases for decreasing block lengths B. Both receivers can deal with a severe near/far situation caused by the power control in the downlink. Moreover they are robust against both channel order over- and underestimation and they can cope with channels with a head or tail approaching zero.

The QRD-based Recursive Least Squares adaptive processing algorithms on the other hand, that detect symbol by symbol, are easily derived from their corresponding block processing algorithm. Both the pilot-trained and the enhanced pilot-trained adaptive chip-level equalizer receiver can track time-varying multi-path channels. The pilot-trained adaptive chip-level equalizer receiver outperforms the pilot-aided adaptive RAKE receiver at medium to high SNR per bit while the enhanced pilot-trained adaptive chip-level equalizer receiver always outperforms the pilot-aided adaptive RAKE receiver. Both the pilot-trained and the enhanced pilot-trained chip-level equalizer receiver have linear complexity in the number of tracked base-stations J and quadratic complexity in the total number of equalizer coefficients QMS_(t)S_(p). Whereas the pilot-trained receiver has only linear complexity in the spreading factor N, the enhanced pilot-trained receiver has quadratic complexity in the spreading factor N. In fact, the enhanced pilot-trained adaptive chip-level equalizer receiver exhibits a N times higher complexity than the regular pilot-trained adaptive chip-level equalizer receiver.

We can conclude that the enhanced pilot-trained spacetime chip-level equalizer receiver is a promising receiver for future WCDMA terminals, from a performance as wel as a complexity point of view.

H. Embodiment

H.1 WCDMA Forward Link Data Model

H.1.a Multi-channel framework. Let us consider the forward link of a single-cell WCDMA system with K active user terminals. The base-station transmits a synchronous code division multiplex, employing user specific orthogonal Walsh-Hadamard spreading codes and a base-station specific a periodic scrambling code. The transmitted multi-user chip sequence consists of K user signals and a continuous pilot signal:

$\begin{matrix} {{x\lbrack n\rbrack} = {{\sum\limits_{k = 1}^{K}{{s_{k}\lbrack i\rbrack}{c_{k}\lbrack n\rbrack}}} + {{s_{p}\lbrack i\rbrack}{c_{p}\lbrack n\rbrack}}}} & (46) \end{matrix}$ with

$i = {\left\lfloor \frac{n}{N} \right\rfloor.}$ Each user's data symbol sequence s_(k)[i] (pilot symbol sequence s_(p)[i]) is spread by a factor N with the user code sequence c_(k)[n] (pilot code sequence c_(p)[n]). The k-th user aperiodic code sequence c_(k)[n] (pilot code sequence c_(p)[n]) is the concatenation of the corresponding Walsh-Hadamard spreading code and the base-station specific a periodic scrambling code.

Assume that the user terminal is equipped with M receive antennas and let h_(m)(t) denote the continuous-time channel from the base-station to the m-th receive antenna, including the transmit and receive filters. By sampling the different received antenna signals at the chiprate

$\begin{matrix} {Y_{\alpha} = \begin{bmatrix} {y\lbrack\alpha\rbrack} & {y\left\lbrack {\alpha + 1} \right\rbrack} & \cdots & {y\left\lbrack {\alpha + {BN} - 1} \right\rbrack} \\ \vdots & \vdots & \; & \vdots \\ {y\left\lbrack {\alpha + Q} \right\rbrack} & {y\left\lbrack {\alpha + Q + 1} \right\rbrack} & \cdots & {y\left\lbrack {\alpha + Q + {BN} - 1} \right\rbrack} \end{bmatrix}} & (48) \end{matrix}$

$\frac{N}{T},$ we obtain the following received vector sequence: y[n]=[y₁[n]y₂[n] . . . y_(M)[n]]^(T) which can be written as:

$\begin{matrix} {{y\lbrack n\rbrack} = {{\sum\limits_{n^{\prime} = 0}^{L}{{h\left\lbrack n^{\prime} \right\rbrack}{x\left\lbrack {n - n^{\prime}} \right\rbrack}}} + {e\lbrack n\rbrack}}} & (47) \end{matrix}$ where e[n] is similarly defined as y[n] and h[n] is the discrete-time M×1 vector channel from the base-station to the M receive antennas, given by: h[n]=[h₁[n]h₂[n] . . . h_(M)[n]]^(T) In this equation

${h_{m}\lbrack n\rbrack} = {h_{m}\left( {n\frac{N}{T}} \right)}$ is the discrete-time channel impulse response from the base-station to the m-th receive antenna. Note that we model h[n] as an M×1 FIR vector filter of order L.

H.1.b Data model for block processing. Let us now introduce the (Q+1)M×BN output matrix Y_(a) (with Hankel structure), shown in equation VIII-I.1.b, where B is the block length, a is the processing delay and Q+1 is the temporal smoothing factor. This output matrix can be written as: Y _(a) =HX _(a) +E _(a)  (49) where the noise matrix E_(a) is similarly defined as Y_(a) and H is the (Q+1)×r(r=L+Q+1) channel matrix (with Toeplitz structure). The r×BN input matrix X_(a) (with Hankel structure) is given by: X_(a)=[x_(a−L) ^(T) . . . x_(a+Q) ^(T)]^(T) with the transmitted multi-user chip sequence vector at delay a: x _(a) =[x[a]x[a+1] . . . x[a+BN−1]] H.2 Semi-blind Chip Equalizer Receiver

In this section, we discuss a new semi-blind chip equalizer receiver, that exploits all code information on one hand and pilot symbol information on the other hand. This receiver is based on the fully blind chip equalizer receiver for the reverse link of WCDMA systems, presented in the second part of [28]. One algorithm for block processing and one for adaptive processing is derived.

H.2.a Block processing. The block processing algorithm for the semi-blind chip equalizer detects B data symbols at once. We can write the transmitted multi-user chip sequence at delay a=0 as follows: x ₀ =s _(d) C _(d) +s _(p) C _(p)  (50) where s_(d) is the 1×KB total transmitted data symbol vector: s_(d)=[s₁ . . . s_(K)] and s_(k) is the k-th user's 1×B transmitted data symbol vector: s _(k) =[s _(k)[0]s _(k)[1] . . . s _(k) [B−1]] The transmitted pilot symbol vector s_(p) is similarly defined as s_(k). The KB×BN user code sequence matrix C_(d) stacks the code sequence matrices of the individual users: C_(d)=[C₁ ^(T) . . . C_(K) ^(T)]^(T) where C_(k) is the k-th user's B×BN code sequence matrix:

$C_{k} = \begin{bmatrix} {c_{k}\lbrack 0\rbrack} & \; & \; \\ \; & ⋰ & \; \\ \; & \; & {c_{k}\left\lbrack {B - 1} \right\rbrack} \end{bmatrix}$ and c_(k)[i] is the k-th user's code sequence vector used to spread the data symbol s_(k)[i]: c _(k) [i]=[c _(k) [iN] . . . c _(k) [iN+N−1]] The B×BN pilot code sequence matrix C_(p) and the pilot code sequence vector c_(p)[i] are similarly defined as C_(k) respectively c_(k)[i].

The vector x₀ is a row of every input matrix from the set {X_(a)}_(a=−Q) ^(L) and is therefore ‘contained’ in every output matrix from the set {Y_(a)}_(a=−Q) ^(L). The semi-blind block processing problem addressed here is to compute the desired user's data symbol sequence s₁ (we assume the first user to be the user of interest) from Y_(a), with −Q≦a≦L, based on the knowledge of the user code sequence matrix C_(d), the pilot code sequence matrix C_(p) and the pilot symbol vector s_(p). In order to solve this problem we make the following rather standard assumptions:

Assumption 3: The channel matrix H has full column rank r.

Assumption 4: The input matrix X_(a) has full row rank r.

The first assumption requires that: (Q+1)(M−1)≧L Therefore the number of antennas should be at least M=2. The second assumption, on the other hand, requires that: BN≧r

Let us first, for the sake of clarity, assume there is no additive noise present in Y_(a) (−Q≦a≦L). Because of assumptions 5 and 6, the rows of Y_(a) span the row space of X_(a). Hence, there exists a 1×(Q+1)M linear chip equalizer f_(a), for which: f _(a) Y _(a) −x ₀=0 and this linear chip equalizer f_(a) is a ZF linear chip equalizer with (Q+1)M−r degrees of freedom (hence, this linear chip equalizer is only unique when (Q+1)M=r). Using Equation 50 we can then write: f _(a) Y _(a) −s _(d) C _(d) −s _(p) C _(p)=0  (51) In order to guarantee the uniqueness of the solution for f_(a), s_(d) up to a complex scaling factor, the matrix [X_(a) ^(T)−C_(d) ^(T)−(s_(p)C_(p))^(T)]^(T) should have at most a one-dimensional left null space. This leads to the following identifiability condition: B(N−K)≧r  (52) Therefore the maximum number of users that can be supported is K=N−1.

Let us now assume that additive noise is present in Y_(a) (−Q≦a≦L). We then solve the following Least Squares (LS) minimisation problem:

$\begin{matrix} {{\overset{\_}{f}}_{a},{{\overset{\_}{s}}_{d} = {\arg{\;\;}{\min\limits_{f_{a},s_{d}}{{{f_{a}Y_{a}} - {s_{d}C_{d}} - {s_{p}C_{p}}}}^{2}}}}} & (53) \end{matrix}$ Since the LS cost function is a quadratic form in f_(a), s_(d), the minimisation can be done independently for f_(a) and s_(d). In order to obtain a direct semi-blind equalizer estimation, we first solve for s_(d), assuming f_(a) to be known and fixed. The LS solution for s_(d) can be simplified to: s _(d)=f_(a)Y_(a)C_(d) ^(H)  (54) because C_(d)C_(d) ^(H)=I_(KB) and C_(p)C_(d) ^(H)=0_(B×KB) due to the orthogonality of the user code sequences and the pilot code sequence at each symbol instant. Substituting s _(d) into the original LS problem (see Equation 53) leads to a modified LS problem in f_(a):

$\begin{matrix} {{\overset{\_}{f}}_{a} = {\arg\mspace{14mu}{\min\limits_{f_{a}}{{{f_{a}{Y_{a}\left( {I_{BN} - {C_{d}^{H}C_{d}}} \right)}} - {s_{p}C_{p}}}}^{2}}}} & (55) \end{matrix}$ which can be interpreted as follows. The equalized signal f_(a)Y_(a) is projected on the orthogonal complement of the space spanned by the user code sequences. Furthermore, the projected equalized signal f_(a)Y_(a) (I_(BN)−C_(d) ^(H)C_(d)) should be as close as possible to the transmitted pilot chip sequence s_(p)C_(p) in a Least Squares sense. Eventually, the LS solution for f_(a) can be written as: f _(a) =s _(p) C _(p) Y _(a) ^(H) {Y _(a)(I _(BN) −C _(d) ^(H) C _(d))Y _(a) ^(H)}⁻¹  (56)

H.2.b Adaptive processing. In this subsection, we derive a Square-Root Information (SRI) RLS type of adaptive algorithm for the semi-blind chip equalizer. By having a closer look at

$\begin{matrix} \left. \left\lbrack {\frac{R_{a}\lbrack i\rbrack}{z_{a}\lbrack i\rbrack}{\frac{0_{{({Q + 1})}M \times 1}}{*}}\frac{0_{{({Q + 1})}M \times {({N - 1})}}}{0_{1 \times {({N - 1})}}}} \right\rbrack\leftarrow{\left\lbrack {\frac{\lambda\;{R_{a}\left\lbrack {i - 1} \right\rbrack}}{\lambda\;{z_{a}\left\lbrack {i - 1} \right\rbrack}}❘\frac{{Y_{a}\lbrack i\rbrack}{{\overset{\sim}{C}}_{d}\lbrack i\rbrack}}{{s_{p}\lbrack i\rbrack}{c_{p}\lbrack i\rbrack}}} \right\rbrack \cdot {T_{a}\lbrack i\rbrack}} \right. & (57) \end{matrix}$ Equation 55, we notice that each new incoming (Q+1)M×N output matrix block Y_(a)[i] is first projected on the orthogonal complement of the space spanned by the user code sequences, by using the projection matrix {tilde over (C)}_(d)[i]:

${{\overset{\sim}{C}}_{d}\lbrack i\rbrack} = {I_{N} - {\sum\limits_{k = 1}^{K}{{c_{k}\lbrack i\rbrack}^{H}{c_{k}\lbrack i\rbrack}}}}$ The new projected output matrix block Y_(a)[i]{tilde over (C)}_(d)[i] is then used together with the new transmitted pilot chip sequence vector s_(p)[i]c_(p)[i] in a QRD-updating step, shown in Equation 57. In this equation, λ is the forget factor, that should be chosen in correspondence with the coherence time of the time-varying channel. The QRD-updating step tracks a lower-triangular factor R_(a)[i] and a corresponding right-hand side z_(a)[i]. The new value for the semi-blind chip equalizer f_(a)[i] then follows from the backward substitution step: f _(a) [i]·R _(a) [i]=z _(a) [i]  (58) H.3 Training-based Chip Equalizer Receiver

In this section, we discuss a training-based chip equalizer receiver [25], that exploits (besides the desired user's code information) only pilot code information on one hand and pilot symbol information on the other hand. Again, one algorithm for block processing and one for adaptive processing is derived.

H.3.a Block processing. The block processing algorithm for the training-based chip equalizer, that detects B data symbols at once, can again be formulated as a LS minimisation problem:

$\begin{matrix} {{\overset{\_}{g}}_{a} = {\arg\mspace{14mu}{\min\limits_{g_{a}}{{{g_{a}Y_{a}C_{p}^{H}} - s_{p}}}^{2}}}} & (59) \end{matrix}$ which can be interpreted as follows. The equalized signal g_(a)Y_(a) is despread with the pilot code sequence matrix C_(p). The equalized signal after despreading g_(a)Y_(a)C_(p) ^(H) should then be as close

$\begin{matrix} \left. \begin{bmatrix} {{\hat{R}}_{a}\lbrack i\rbrack} & 0_{{({Q + 1})}M \times 1} \\ {{\hat{z}}_{a}\lbrack i\rbrack} & \bigstar \end{bmatrix}\leftarrow{\quad{\begin{bmatrix} {\lambda\;{{\hat{R}}_{a}\left\lbrack {i - 1} \right\rbrack}} & {{Y_{a}\lbrack i\rbrack}{c_{p}\lbrack i\rbrack}^{H}} \\ {\lambda\;{{\hat{z}}_{a}\left\lbrack {i - 1} \right\rbrack}} & {s_{p}\lbrack i\rbrack} \end{bmatrix} \cdot {{\hat{T}}_{a}\lbrack i\rbrack}}} \right. & (62) \end{matrix}$ as possible to the transmitted pilot symbol vector s_(p) in a Least Squares sense. The LS solution for g_(a) can be written as: g _(a) =s _(p) C _(p) Y _(a) ^(H) {Y _(a)(C _(p) ^(H) C _(p))Y _(a) ^(H)}⁻¹  (60) In order to guarantee the uniqueness of the solution for g_(a) up to a complex scaling factor, the matrix [(X_(a)C_(p) ^(H))^(T)−s_(p) ^(T)]^(T) should have at most a one-dimensional left null space. This leads to the following identifiability condition: B≧r  (61) Note that when K=N−1, I_(BN)−C_(d) ^(H)C_(d)=C_(p) ^(H)C_(p). This means that for a fully loaded system (K=N−1) the semi-blind method is exactly the same as the training-based method. This is also indicated by the identifiability conditions 52 and 61.

H.3.b Adaptive processing. In this subsection, we derive an SRI-RLS type of adaptive algorithm for the training-based chip equalizer. By having a closer look at Equation 59, we notice that each new incoming (Q+1)M×N output matrix block Y_(a)[i] is first despread with the pilot code sequence vector c_(p)[i]. The new despread output matrix block Y_(a)[i]c_(p)[i]^(H) is then used together with the new pilot symbol s_(p)[i] in a QRD-updating step, shown in Equation 62. In this equation λ is again the forget factor. The QRD-updating step tracks a lower-triangular factor {circumflex over (R)}_(a)[i] and a corresponding right-hand side {circumflex over (z)}_(a)[i]. The new value for the training-based chip equalizer g_(a)[i] then follows from the backward substitution step: g _(a) [i]·{circumflex over (R)} _(a) [i]={circumflex over (z)} _(a) [i]  (63) H.4 Conclusion

We have developed new training-based and semi-blind space-time chip equalizer receivers for the forward link of WCDMA systems employing a continous code-multiplexed pilot. The proposed receivers can track fast fading multipath channels and outperform the conventional RAKE receiver with perfect channel knowledge. For full system load, the training-based and the semi-blind approach have exactly the same performance. However, for low to medium system load, the semi-blind approach outperforms the training-based approach.

I. Embodiment

I.1 WCDMA Forward Link Data Model

I.1.a Multi-channel framework. Let us consider the forward link of a single-cell WCDMA system with K active user terminals. The base-station transmits a synchronous code division multiplex, employing user specific orthogonal Walsh-Hadamard spreading codes and a base-station specific a periodic scrambling code. The transmitted multi-user chip sequence can then be written as:

$\begin{matrix} {{x\lbrack n\rbrack} = {\sum\limits_{k = 1}^{K}{{s_{k}\lbrack i\rbrack}{c_{k}\left\lbrack {n\mspace{11mu}{mod}\mspace{11mu}\rho\; N} \right\rbrack}}}} & (64) \end{matrix}$ with

$i = {\left\lfloor \frac{n}{N} \right\rfloor.}$ Each user's symbol sequence is transmitted in blocks of B=P+D symbols, where the first P symbols are known pilot symbols and the final D symbols are unknown data symbols. The pilot symbols correspond to the Dedicated Physical Control CHannel (DPCCH) whereas the data symbols correspond to the Dedicated Physical Data CHannel (DPDCH) in the UTRA specification for 3G mobile communications [11]. Each user's symbol sequence s_(k)[i] is spread by a factor N with the length-ρN user code sequence c_(k)[n]. The k-th user aperiodic code sequence is the multiplication of the user specific Walsh-Hadamard spreading code and the base-station specific aperiodic scrambling code.

We assume that the user terminal is equipped with M receive antennas. The received antenna signals are sampled at the chiprate

$\frac{1}{T_{c}}$ and the obtained samples are stacked in the M×1 received vector sequence y[n]=[y₁[n] y₂[n] . . . y_(M)[n]]^(T), which can be written as:

$\begin{matrix} {{y\lbrack n\rbrack} = {{\sum\limits_{n^{\prime} = 0}^{L}{{h\left\lbrack n^{\prime} \right\rbrack}{x\left\lbrack {n - n^{\prime}} \right\rbrack}}} + {e\lbrack n\rbrack}}} & (65) \end{matrix}$ where e[n] is the M×1 received noise vector sequence and h[n]=[h₁[n] h₂[n] . . . h_(M)[n]]^(T) is the M×1 discrete-time vector channel from the base-station to the M receive antennas. Note that we model h[n] as an M×1 FIR vector filter of order L.

I.1.b Data model for block processing. Let us now introduce the following (Q+1)M×BN output matrix Y_(a) (with Hankel structure):

$Y_{a} = \begin{bmatrix} {y\lbrack a\rbrack} & \cdots & {y\left\lbrack {a + {BN} - 1} \right\rbrack} \\ \vdots & \; & \vdots \\ {y\left\lbrack {a + Q} \right\rbrack} & \cdots & {y\left\lbrack {a + Q + {BN} - 1} \right\rbrack} \end{bmatrix}$ where B is the block length, a is the processing delay and Q+1 is the temporal smoothing factor. This output matrix can be written as: Y _(a) =HX _(a) +E _(a)  (66) where the noise matrix E_(a) is similarly defined as Y_(a), the (Q+1)M×r channel matrix H (with block Toeplitz structure) is given by:

$\mathcal{H} = \begin{bmatrix} {h\lbrack L\rbrack} & \cdots & {h\lbrack 0\rbrack} & 0 & \cdots & 0 \\ 0 & {h\lbrack L\rbrack} & \cdots & {h\lbrack 0\rbrack} & \cdots & 0 \\ \; & \; & ⋰ & \; & ⋰ & \; \\ 0 & \cdots & 0 & {h\lbrack L\rbrack} & \cdots & {h\lbrack 0\rbrack} \end{bmatrix}$ and the r×BN input matrix X_(a) (with Hankel structure) is defined by: X _(a) =[x _(a−L) ^(T) . . . x _(a+Q) ^(T)]^(T) with the transmitted multi-user chip sequence vector x_(a), starting at delay a, given by: x _(a) =[x[a]x[a+1] . . . x[a+BN−1]]  (67) r=L+1+Q is called the system order. In order to guarantee the existence of Zero-Forcing (ZF) chip equalizers, we make the following rather standard assumptions about the data model:

Assumption 5: The channel matrix H has full column rank r.

Assumption 6: The input matrix X_(a) has full row rank r.

The first assumption requires that: (Q+1)(M−1)≧L  (68) which states that the number of receive antennas should be at least M=2 and correspondingly the temporal smoothing factor Q+1 should be at least the channel order L. The second assumption requires that: BN≧r  (69) which states that the number of observed chip samples BN should be larger than the system order r. I.2 Space-time Block Chip Equalizer Receivers

In this section we develop a DPCCH-trained and an enhanced DPCCH-trained chip equalizer receiver for the mobile user terminal that can detect the desired user's data symbols from the received base-station signal. Both chip equalizer receivers consist of a linear space-time chip equalizer followed by a correlator. The space-time chip equalizer tries to restore the transmitted multi-user chip sequence by linearly combining the discrete-time signals from the different receive antennas. The correlator descrambles and despreads the equalized signal with the desired user's aperiodic code sequence and generates the soft decisions about the desired user's data symbols. These soft decisions are then input to a decision device that generates the final hard decisions.

The DPCCH-trained and the enhanced DPCCH-trained chip equalizer receivers differ in the amount of a-priori information they assume to determine their equalizer coefficients. In the following, we will describe for both receivers a Least Squares (LS) block processing algorithm, that addresses a block of B symbols at once.

I.2.a Preliminary definitions. The transmitted multi-user chip sequence vector x₀, starting at delay a=0, consists of two parts: a first part corresponding to the known pilot symbols of the DPCCH and a second part corresponding to the unknown data symbols of the DPDCH. Using Equation 1 and 67, we can write the 1×BN transmitted multi-user chip sequence vector x₀ as follows: x ₀ =s _(p) C _(p) +s _(d) C _(d)  (70) where s_(p) is the 1×KP total pilot symbol vector that stacks the pilot symbol vectors of the different active users: s_(p)=[s_(p,1) . . . s_(p,K)] and s_(p,k) is the 1×P pilot symbol vector that stacks the pilot symbols of the k-th user: s _(p,k) =[s _(k)[0]s _(k)[1] . . . s _(k) [P−1]] The 1×KD total data symbol vector s_(d) and the 1×D data symbol vector of the k-th user s_(d,k) are similarly defined as s_(p) respectively s_(p,k). The KP×BN total pilot code sequence matrix C_(p) stages the pilot code sequence matrices of the different active users: C_(p)=[C_(p,1) ^(T) . . . C_(p,K) ^(T)]^(T) where C_(p,k) is the P×BN pilot code sequence matrix of the k-th user: C _(p,k) =[ C _(p,k)|0_(P×DN)] and C _(p,k) is the P×PN effective pilot code sequence matrix of the k-th user that stacks the k-th user's code sequence vectors used to spread its pilot symbols:

${\overset{\_}{C}}_{p,k} = \begin{bmatrix} {c_{k}\lbrack 0\rbrack} & \; & \; \\ \; & ⋰ & \; \\ \; & \; & {c_{k}\left\lbrack {P - 1} \right\rbrack} \end{bmatrix}$ The 1×N code sequence vector of the k-th user c_(k)[i] is used to spread the data symbol s_(k)[i]: c _(k) [i]=[c _(k)[(i mod ρ)N] . . . c _(k)[(i mod ρ)N+N−1]] The KP×PN total effective pilot code sequence matrix C _(p) stacks the effective pilot code sequence matrices of the different active users: C _(p)=[ C _(p,1) ^(T) . . . C _(p,K) ^(T)]^(T) The KD×BN total data code sequence matrix C_(d) and the D×BN data code sequence matrix of the k-th user C_(d,k) are similarly defined as C_(p) respectively C_(p,k). The D×DN effective data code sequence matrix of the k-th user C _(d,k) and the KD×DN total effective data code sequence matrix C _(d) are similarly defined as C _(p,k) respectively C _(p).

The vector x₀ is a row of every input matrix from the set {X_(a)}_(a=−Q) ^(L) and is therefore contained in every output matrix from the set {Y_(a)}_(a=−Q) ^(L). Note that the output matrix Y_(a) can also be partitioned into two parts: Y_(a)=[Y_(p,a)Y_(d,a)] where the (Q+1)M×PN pilot part Y_(p,a) corresponds to the DPCCH and the (Q+1)M×DN data part Y_(d,a) corresponds to the DPDCH.

I.2.b DPCCH-trained block chip equalizer receiver. The DPCCH-trained block chip equalizer receiver estimates the desired user's data symbol vector s_(d,k) (we assume the desired user to be the k-th user of the base-station multiplex) from Y_(a), with −Q≦a≦L, based on the knowledge of the desired user's data code sequence matrix C_(d,k), the desired user's pilot code sequence matrix C_(p,k) and the desired user's pilot symbol vector s_(p,k).

We assume first, for the sake of clarity, there is no additive noise present in Y_(a) (−Q≦a≦L). Because of assumptions 5 and 6, the rows of Y_(a) span the row space of X_(a). Hence, there exists a 1×(Q+1)M linear DPCCH-trained chip equalizer g_(a), for which: g _(a) Y _(a) −x ₀=0 where g_(a) is a ZF linear chip equalizer with (Q+1)M−r degrees of freedom. By despreading the above equation with the desired user's pilot code sequence matrix C_(p,k) and by using Equation 70 we can then write: g _(a) Y _(a) C _(p,k) ^(H) −s _(p,k)=0 because of the orthogonality between the data code sequence matrices C_(d,k) (k=1 . . . K) and the pilot code sequence matrices C_(p,k) (k=1 . . . K) of the different active users.

Let us now assume that additive noise is present in Y_(a) (−Q≦a≦L). We then solve the following Least Squares (LS) minimisation problem:

$\begin{matrix} {{\overset{\_}{g}}_{a} = {\arg\;{\min\limits_{g_{a}}{{{g_{a}Y_{a}C_{p,k}^{H}} - s_{p,k}}}^{2}}}} & (72) \end{matrix}$ which can be interpreted as follows. The equalized signal g_(a)Y_(a) is first despread with the desired user's pilot code sequence matrix C_(p,k). The equalized signal after despreading g_(a)Y_(a)C_(p,k) ^(H) should then be as close as possible to the desired user's known pilot symbol vector s_(p,k) in a Least Squares sense.

It is easy to prove that the LS problem in Equation 72 can be rewritten as follows:

$\begin{matrix} {{\overset{\_}{g}}_{a} = {\arg\;\underset{g_{a}}{\;\min}\;{{{g_{a}Y_{p,a}{\overset{\_}{C}}_{p,k}^{H}} - s_{p,k}}}^{2}}} & (73) \end{matrix}$ showing that the DPCCH-trained LS problem only exploits the pilot part Y_(p,a) of the output matrix Y_(a).

The LS solution for g_(a) can be written as: g _(a) =s _(p,k) C _(p,k) Y _(a) ^(H) {Y _(a)(C _(p,k) ^(H) C _(p,k))Y _(a) ^(H)}⁻¹  (74) The obtained LS DPCCH-trained chip equalizer g _(a) may subsequently be used to extract the 1×D desired user's vector of soft symbol decisions s _(d,k): s _(d,k)= g _(a)Y_(a)C_(d,k) ^(H)  (7) Finally, the soft decisions s _(d,k) are fed into a decision device that determines the nearest constellation point. This results in the hard decisions ŝ_(d,k).

I.2.c Enhanced DPCCH-trained block chip equalizer receiver. The enhanced DPCCIH-trained block chip equalizer receiver estimates the desired user's data symbol vector s_(d,k) from Y_(a), with −Q≦a≦L, based on the knowledge of the total data code sequence matrix C_(d), the total pilot code sequence matrix C_(p) and the total pilot symbol vector s_(p).

We assume first, for the sake of clarity, there is no additive noise present in Y_(a) (−Q≦a≦L). Because of assumptions 5 and 6, the rows of Y_(a) span the row space of X_(a). Hence, there exists a 1×(Q+1)M linear enhanced DPCCH-trained chip equalizer f_(a), for which: f _(a) Y _(a) −x ₀=0 where f_(a) is a ZF linear chip equalizer with (Q+1)M−r degrees of freedom. Using Equation 70 we can then write: f _(a) Y _(a) −s _(d) C _(d) −s _(p) C _(p)=0  (76)

Let us now assume that additive noise is present in Y_(a) (−Q≦a≦L). We then solve the following Least Squares (LS) minimisation problem:

$\begin{matrix} {{\overset{\_}{f}}_{a},{{\overset{\_}{s}}_{d} = {\arg\;{\min\limits_{f_{a},s_{d}}{{{f_{a}Y_{a}} - {s_{d}C_{d}} - {s_{p}C_{p}}}}^{2}}}}} & (77) \end{matrix}$ Since the LS cost function is a quadratic form in f_(a),s_(d), the minimisation can be done independently for f_(a) and s_(d). In order to obtain a direct equalizer estimation, we first solve for s_(d), assuming f_(a) to be known and fixed. The LS solution for s_(d) can be simplified to: s _(d)f_(a)Y_(a)C_(d) ^(H)  (78) because of the orthogonality between the data code sequence matrices C_(d,k) (k=1 . . . K) and the pilot code sequence matrices C_(p,k) (k=1 . . . K) of the different active users. Substituting s _(d) into the original LS problem of Equation 77, leads to a modified LS problem in f_(a):

$\begin{matrix} {{\overset{\_}{f}}_{a} = {\arg\underset{f_{a}}{\;\min}\;{{{f_{a}{Y_{a}\left( {I_{BN} - {C_{d}^{H}C_{d}}} \right)}} - {s_{p}C_{p}}}}^{2}}} & (79) \end{matrix}$ which can be interpreted as follows. The equalized signal f_(a)Y_(a) is projected on the orthogonal complement of the subspace spanned by the total data code sequence matrix C_(d). The equalized signal after projecting f_(a)Y_(a)(I_(BN)−C_(d) ^(H)C_(d)) should then be as close as possible to the known total pilot chip sequence vector s_(p)C_(p) in a Least Squares sense.

It is easy to prove that the modified LS problem of Equation 79 can be rewritten as in Equation 80 (see the top of the next page) showing that the enhanced DPCCH-trained LS problem naturally decouples into two different parts: a fully-trained part and a fully-blind part. On the one hand, the fully-trained part (corresponding to the first term in Equation 80)

$\begin{matrix} {{\overset{\_}{f}}_{a} = {\arg\underset{f_{a}}{\;\min}\mspace{11mu}\left\{ {{{{f_{a}Y_{p,a}} - {s_{p}{\overset{\_}{C}}_{p}}}}^{2} + {{f_{a}{Y_{d,a}\left( {I_{DN} - {{\overset{\_}{C}}_{d}^{H}{\overset{\_}{C}}_{d}}} \right)}}}^{2}} \right\}}} & (80) \end{matrix}$ forces the equalized signal to be as close aspossible (in a Least Squares sense) to the transmitted multi-user chip sequence during the training mode of the receiver. On the other hand, the fully-blind part (corresponding to the second term of Equation 80) projects the equalized signal on the orthogonal complement of the subspace spanned by the total effective data code sequence matrix during the blind mode of the receiver. Furthermore, the energy in the projected equalized signal should be as small as possible (in a Least Squares sense) which actually corresponds to a Minimum Output Energy (MOE) criterion. For this reason, the enhanced DPCCH-trained chip equalizer is actually a semi-blind chip equalizer.

The LS solution for f_(a) can be written as: f _(a) 32 s _(p) C _(p) Y _(a) ^(H) {Y _(a)(I _(BN) −C _(d) ^(H) C _(d))Y _(a) ^(H)}⁻¹  (81) The obtained LS enhanced DPCCH-trained chip equalizer f _(a) may subsequently be used to extract the 1×D desired user's vector of soft symbol decisions s _(d,k). Equation 75 remains valid if we replace g _(a) by f _(a). I.3 Space-time Adaptive Chip Equalizer Receivers

In this section, we derive Square Root Information (SRI) Recursive Least Squares (RLS) type [24] of adaptive chip equalizer receivers from the corresponding LS block chip equalizer receivers. These receivers address a single symbol at once and consist of two parts: a DPCCH-aided updating part and a user specific detection part.

I.3.a DPCCH-trained adaptive chip equalizer receiver. The DPCCH-trained updating part can work in two modes: a training mode and a decision-directed mode. By having a closer look at the corresponding LS block processing algorithm in Equation 73, it is rather straightforward to derive an SRI-RLS type of adaptive processing algorithm. The (Q+1)M×N new incoming output matrix block Y_(a)[i] is first despread with the desired user's code sequence vector for the i-th symbol instant c_(k)[i]. The new despread output matrix block Y_(a)[i]c_(k) ^(H)[i] is then used together with the new desired user's pilot symbol s_(k)[i] (during the training mode) or the new desired user's estimated data symbol ŝ_(k)[i] (during the decision-directed mode) in a QRD-updating step [24]. The QRD-updating step tracks a (Q+1)M×(Q+1)M lower triangular factor R_(a)[i] and a corresponding 1×(Q+1)M right-hand side z_(a)[i]. The new values for the equalizer coefficients then follow from the backsubstitution step: g _(a) [i]·R _(a) [i]=z _(a) [i]  (82)

I.3.b Enhanced DPCCH-trained adaptive chip equalizer receiver. The enhanced DPCCH-trained updating part can also work in two modes: a training mode and a blind mode. By having a closer look at the corresponding LS block processing algorithm in Equation 80, it is rather straightforward to derive an SRI-RLS type of adaptive processing algorithm. During the training mode, the new incoming output matrix block Y_(a)[i] is used together with the 1×N new multi-user chip sequence vector x₀[i] in a QRD-updating step. During the blind mode, the new incoming output matrix block Y_(a)[i] is first projected on the orthogonal complement of the subspace spanned by the active user codes, using the N×N projection matrix I_(N)−C_(d) ^(H)[i]C_(d)[i]. The new projected output matrix block is then used together with the 1×N zero vector 0_(1×N) in a QRD-updating step [24].

I.4 Simulation Results

The simulations are performed for the forward link of a WCDMA system with K active, equal power users, QPSK data modulation, real orthogonal Walsh-Hadamard spreading codes of length N=8 along with a random scrambling code whose period measures ρ=B symbols. The number of receive antennas M=2, the block length B=200 and the number of pilot symbols P=15. The vector channel with order L=3 has M(L+1)=8 Rayleigh distributed channel taps of equal average power. The temporal smoothing factor is chosen to be Q=L and the performance sa averaged over 100 randomly generated channels.

FIG. 18 compares the average BER versus the average SNR per bit of the conventional RAKE receiver with perfect channel knowledge, the DPCCH-trained (DPCCH-T CE) and the enhanced DPCCH-trained chip equalizer (E-DPCCH-T CE) receiver for half system load (K=4). Also shown in the figure is the BER-curve of the ideal fully-trained chip equalizer (F-T CE) receiver that assumes knowledge of the total transmitted multi-user chip sequence and the theoretical BER-curve of QPSK with M(L+1)-th order diversity in Rayleigh fading channels (single-user bound) [27]. The ideal RAKE receiver dearly exhibits an error floor due to the MUI while both the DPCCH-T and the F-DPCCH-T CE receiver are near/far resistant. The E-DPCCH-T CE receiver realizes a 5 dB gain at a BER of 10⁻² compared to the regular DPCCH-T CE receiver. Moreover, its performance nearly approaches that of the ideal F-T CE receiver.

FIG. 19 shows the same curves but now for a fully loaded system (K=8). Although the performance curves shift up due to the increased MUI, the formerly obtained conclusions remain valid.

I.5 Conclusion

In this paper, we proposed new DPCCH-trained and enhanced DPCCH-trained chip equalizer receivers for the forward link of WCDMA systems employing a time-multiplexed pilot. In reduced as well as full load settings, the enhanced DPCCH-trained receiver outperforms the regular DPCCH-trained receiver and comes close to the performance of the ideal fully-trained receiver.

The LS block chip equalizer receivers on the one hand, that detect B symbols at once, are suited for packet-switched type of communication. The performance gain of the enhanced DPCCH-trained receiver compared to the regular DPCCH-trained receiver increases for decreasing pilot length P. The RLS adaptive chip equalizer receivers on the other hand, that detect symbol by symbol, are suited for circuit-switched type of communication. They are able to track time-varying multipath channels and outperform the conventional RAKE receiver with perfect channel knowledge.

We can conclude that the enhanced DPCCH-trained chip equalizer receiver is a promising receiver for future WCDMA terminals, from a performance as well as complexity point of view.

J. Embodiment

J.1 Forward Link DS-CDMA Data Model

Let us consider the forward link of a single-cell DS-CDMA system. The base-station transmits a synchronous code division multiplex, employing user-specific orthogonal Walsh-Hadamard spreading codes and a site-specific aperiodic scrambling code. The transmitted multi-user chip sequence during the i-th symbol period consists of K user signals and a continuous pilot signal

$\begin{matrix} {{x\left\lbrack {i,n} \right\rbrack} = {{\sum\limits_{k = 1}^{K}{A_{k} \cdot {b_{k}\lbrack i\rbrack} \cdot {c_{k}\left\lbrack {i,n} \right\rbrack}}} + {A_{p} \cdot {b_{p}\lbrack i\rbrack} \cdot {c_{p}\left\lbrack {i,n} \right\rbrack}}}} & (83) \end{matrix}$ with A_(k), b_(k)[i], c_(k)[i,n] and A_(p), b_(p)[i], c_(p)[i,n] the amplitude, the transmitted bit and the aperiodic spreading code of user k respectively the pilot signal during the i-th symbol period. c_(k)[i,n]=w_(k)[n]·s[i,n] for n=0 . . . N−1 with N the spreading factor, is the component wise multiplication of the Walsh-Hadamard spreading code w_(k)[n] and the aperiodic scrambling code s[i,n]. A similar equation also holds for the aperiodic spreading code of the pilot signal c_(p)[i,n]=w_(p)[n]·s[i,n]. The total transmitted multi-user chip sequence is then the sum of different time-shifted versions of x[i,n]:

$\begin{matrix} {{x\lbrack n\rbrack} = {\sum\limits_{i}{x\left\lbrack {i,{n - {Ni}}} \right\rbrack}}} & (84) \end{matrix}$

The impulse response of the time-varying composite channel during the i-th symbol period, consisting of the low-pass transmit and receive filters and the actual propagation channel, can be modeled as follows:

$\begin{matrix} {{h^{(i)}(t)} = {\sum\limits_{m = 0}^{M - 1}{{h^{(i)}\lbrack m\rbrack} \cdot {\Psi_{RC}\left( {t - \tau_{m}} \right)}}}} & (85) \end{matrix}$ with M the number of paths in the propagation channel. h^((i))[m] and τ_(m) are the time-varying complex channel gain respectively the delay of the m-th path. The composite chip waveform ψ_(RC)(t) includes both transmit and receive low-pass filters and has a raised-cosine spectrum.

The total received baseband signal at the mobile user terminal can be written as:

$\begin{matrix} {{{y(t)} = {\sum\limits_{i}{\sum\limits_{n}{x\left\lbrack {i,n} \right\rbrack}}}}{{\cdot {h^{(i)}\left( {t - {\left( {{Ni} + n} \right)T_{c}}} \right)}} + {\eta(t)}}} & (86) \end{matrix}$

$\begin{matrix} {\quad{{H_{q}\lbrack i\rbrack} = \left\lbrack \begin{matrix} {h_{q}\left\lbrack {i,L} \right\rbrack} & {h_{q}\left\lbrack {i,{L - 1}} \right\rbrack} & \cdots & {h_{q}\left\lbrack {i,0} \right\rbrack} & 0 & \cdots & 0 \\ 0 & {h_{q}\left\lbrack {i,L} \right\rbrack} & \cdots & {h_{q}\left\lbrack {i,1} \right\rbrack} & {h_{q}\left\lbrack {i,0} \right\rbrack} & \cdots & 0 \\ \vdots & \vdots & ⋰ & \vdots & ⋰ & ⋰ & \vdots \\ 0 & \cdots & 0 & {h_{q}\left\lbrack {i,L} \right\rbrack} & \cdots & {h_{q}\left\lbrack {i,1} \right\rbrack} & {h_{q}\left\lbrack {i,0} \right\rbrack} \end{matrix} \right\rbrack}} & (89) \end{matrix}$ where η(t) is a zero-mean complex colored Gaussian noise process with power spectral density σ². The noise coloring is due to the receive low-pass filter. The signal y(t) is now temporally oversampled at twice the chip-rate to obtain the first phase y₁[n]=y(nT_(c)) respectively the second phase

${y_{2}\lbrack n\rbrack} = {y\left( {\frac{T_{c}}{2} + {nT}_{c}} \right)}$ of the oversampled received signal. These discret-time signals correspond to the first phase h₁[i,n]=h^((i))(nT_(c)) respectively the second phase h₂[i,n]=

${h_{2}\left\lbrack {i,n} \right\rbrack} = {h^{(i)}\left( {\frac{T_{c}}{2} + {n\; T_{c}}} \right)}$ of the oversampled composite channel, that both have a channel order L.

We can now construct the (N+2F)-dimensional received signal vector for the first and the second phase, q=1 . . . 2, during the i-th symbol period y_(q)[i]=[y_(q)[iN−F] . . . y_(q)[(i+1)N−1+F]]^(T), with 2F+1 the total length of the equivalent symbol-spaced equalizers. The total received signal vector during the i-th symbol period y[i]=[y₁ ^(T)[i] y₂ ^(T)[i]]^(T) stacks the received signal vectors for both phases and relates to the (N+2F+L)-dimensional transmitted multi-user chip sequence vector x[i]=[x[iN−F−L] . . . x[(i+1)N−1+F]]^(T) through the following equation: y[i]=H[i]·x[i]+η[i]  (87) The total channel matrix H[i] stacks the channel matrices for the first respectively the second phase during the i-th symbol period:

$\begin{matrix} {{\mathcal{H}\lbrack i\rbrack} = \begin{bmatrix} {H_{1}\lbrack i\rbrack} \\ {H_{2}\lbrack i\rbrack} \end{bmatrix}} & (88) \end{matrix}$ while η[i] is the received noise vector during the i-th symbol period. H_(q)[i], for q=1 . . . 2 is a (N+2F)×(N+2F+L) Toeptitz-matrix, as shown in equation 89 on the next page.

Finally, the transmitted multi-user chip sequence vector relates to the transmitted bits through the following equation: x[i]=C[i]·A·b[i]  (90) The (3K+3)-dimensional bit vector b[i] contains the user and pilot transmitted bits during the i-th symbol period as well as the previous and the next symbol period. The (N+2F+L)×(3K+3) code matrix C[i] contains the aperiodic spreading codes corresponding to each of the transmitted bits while the (3K+3)-dimensional diagonal matrix A contains the amplitudes of the user signals and the pilot signal. J.2 Adaptive Chip Equalizer Receiver

The proposed pilot-aided adaptive chip equalizer receiver is well-suited for implementation in a mobile user terminal, operating in fast fading multipath channels. By exploiting the presence of a continuous pilot signal in forthcoming third generation cellular and LEO satellite communication systems, it continuously updates at the symbol rate using a simple NLMS or a more advanced RLS adaptive scheme. The proposed receiver basically consists of two parts: a user-specific detection part and a pilot-aided updating part.

J.2.a User-specific detection part. The user-specific detection part operates like the conventional chip equalizer receiver, as shown in FIG. 20. Initially, it uses for each symbol instant the corresponding equalizer coefficients, drawn from the pilot-aided updating part (see next subsection), to equalize the received signal. Finally, it descrambles and despreads the equalized signal with the aperiodic spreading code of the user of interest (user k) and makes a decision about the transmitted data symbol. Let g_(q) ^(H)[i]=[g_(q)[i,−F] . . . g_(q)[i,0] . . . g_(q)[i,+F]], for q=1 . . . 2 denote the equalizer coefficient vector for the first respectively the second phase of the received signal during the i-th symbol period. The total equalizer coefficient vector g^(H)[i]=[g₁ ^(H)[i] g₂ ^(H)[i]] stacks the equalizer coefficient vectors for the first and the second phase. The soft estimate of the k-th user data symbol during the i-th symbol period can then

$\begin{matrix} {{G_{q}\lbrack i\rbrack} = \begin{bmatrix} {g_{q}\left\lbrack {i,{- F}} \right\rbrack} & {g_{q}\left\lbrack {i,{{- F} + 1}} \right\rbrack} & \cdots & {g_{q}\left\lbrack {i,{+ F}} \right\rbrack} & 0 & \cdots & 0 \\ 0 & {g_{q}\left\lbrack {i,{- F}} \right\rbrack} & \cdots & {g_{q}\left\lbrack {i,{{+ F} - 1}} \right\rbrack} & {g_{q}\left\lbrack {i,{+ F}} \right\rbrack} & \cdots & 0 \\ \vdots & \vdots & ⋰ & \vdots & ⋰ & ⋰ & \vdots \\ 0 & \cdots & 0 & {g_{q}\left\lbrack {i,{- F}} \right\rbrack} & \cdots & {g_{q}\left\lbrack {i,{{+ F} - 1}} \right\rbrack} & {g_{q}\left\lbrack {i,{+ F}} \right\rbrack} \end{bmatrix}} & (93) \end{matrix}$ be written as follows:

$\begin{matrix} \begin{matrix} {{{\overset{\sim}{b}}_{k}\lbrack i\rbrack} = {{{c_{k}\lbrack i\rbrack} \cdot {G_{1}\lbrack i\rbrack} \cdot {y_{1}\lbrack i\rbrack}} + {{c_{k}\lbrack i\rbrack} \cdot {G_{2}\lbrack i\rbrack} \cdot {y_{2}\lbrack i\rbrack}}}} \\ {= {{c_{k}\lbrack i\rbrack} \cdot {G\lbrack i\rbrack} \cdot {y\lbrack i\rbrack}}} \end{matrix} & (91) \end{matrix}$ with c_(k)[i]=c_(k)[i,0] . . . c_(k)[i,N−1] the k-th user aperiodic spreading code vector. The total equalizer matrix G[i] simply stacks the equalizer matrices for both phases during the i-th symbol period: G[i]=[ G₁[i]G₂[i]]  (92) The equalizer matrix G_(q)[i], with q=1 . . . 2 for the first respectively the second phase is a N×(N+2F) Toeplitz-matrix of the corresponding equalizer coefficient vector, as shown in equation 93. Note that the equalized signal is common to all codes that have been assigned to the mobile user terminal in a multi-code system, so a single equalizer suffices.

J.2.b P_(i)lot-aided updating part. The updating part employs the pilot signal to adaptively update the equalizer coefficients at the symbol rate, as opposed to a classical chip equalizer that would be updated at the chip rate. The latter requires a continuous muliti-user training chip sequence for fast fading channel conditions, which is impossible to realize in practice.

By reversing the order of the equalization and the descrambling/despreading for the pilot signal, one obtains an elegant adaptive scheme operating at the symbol rate, as shown in FIG. 21. Starting from the original equation 91, applied to the pilot signal, the soft estimate

$\begin{matrix} {{C_{p}\lbrack i\rbrack} = \left\lbrack \begin{matrix} {c_{p}\left\lbrack {i,0} \right\rbrack} & {c_{p}\left\lbrack {i,1} \right\rbrack} & \cdots & {c_{p}\left\lbrack {i,{N - 1}} \right\rbrack} & 0 & \cdots & 0 \\ 0 & {c_{p}\left\lbrack {i,0} \right\rbrack} & \cdots & {c_{p}\left\lbrack {i,{N - 2}} \right\rbrack} & {c_{p}\left\lbrack {i,{N - 1}} \right\rbrack} & \cdots & 0 \\ \vdots & \vdots & ⋰ & \vdots & ⋰ & ⋰ & \vdots \\ 0 & \cdots & 0 & {c_{p}\left\lbrack {i,0} \right\rbrack} & \cdots & {c_{p}\left\lbrack {i,{N - 2}} \right\rbrack} & {c_{p}\left\lbrack {i,{N - 1}} \right\rbrack} \end{matrix} \right\rbrack} & (95) \end{matrix}$ of the pilot symbol during the i-th symbol period can be written as:

$\begin{matrix} \begin{matrix} {{{\overset{\sim}{b}}_{p}\lbrack i\rbrack} = {{{g_{1}^{H}\lbrack i\rbrack} \cdot {C_{p}\lbrack i\rbrack} \cdot {y_{1}\lbrack i\rbrack}} + {{g_{2}^{H}\lbrack i\rbrack} \cdot {C_{p}\lbrack i\rbrack} \cdot {y_{2}\lbrack i\rbrack}}}} \\ {= {{g^{H}\lbrack i\rbrack} \cdot \begin{bmatrix} {C_{p}\lbrack i\rbrack} & 0 \\ 0 & {C_{p}\lbrack i\rbrack} \end{bmatrix} \cdot {y\lbrack i\rbrack}}} \end{matrix} & (94) \end{matrix}$ with C_(p)[i] the descrambling/despreading matrix for the pilot signal, being a (2F+1)×(N+2F) Toeplitz-matrix of the pilot's aperiodic spreading code vector c_(p)[i]=[c_(p)[i,0] . . . c_(p)[i,N−1]], as shown in equation 95. For a particular symbol the pilot's descrambling/despreading matrix provides 2F+1 correlation values for each phase (so 2·(2F+1) in total), with F the single-sided symbol-spaced equalizer length. These values are the correlator outputs at the correct symbol instant and F chip periods before and after the correct symbol instant. The total equalizer coefficient vector g^(H)[i] (with 2·(2F+1) coefficients) coherently combines these correlation values to obtain an estimate of the transmitted pilot symbol. The optimal equalizer coefficients are determined iteratively by using a simple NLMS scheme or a more advanced RLS scheme [24]. J.3 Simulation Results

The simulations are performed for the forward link of a DS-CDMA system with K active, equal power users, BPSK data modulation, real orthogonal Walsh-Hadamard codes of length N=32 along with a Gold overlay code for scrambling. The period of the Gold overlay scrambling code measures 12 symbols or equivalently 384 chips. Both the first and second phase of the oversampled channel were modeled by L+1=2 (with L the channel order) independent time-varying Rayleigh-distributed channel taps of equal average power. The Symbol-Spaced (SS) Chip Equalizer (CE) receiver only uses the first phase while the Fractionally-Spaced (FS) Chip Equalizer (CE) receiver uses both phases of the received signal. The independence of the different channel taps of the different phases approximately corresponds to a roll-off factor α=1.

FIG. 22 compares the average BER versus the average SNR per bit of the RAKE receiver with perfect channel knowledge, the SS-CE receiver and the FS-CE receiver with both NLMS and RLS updating for quarter system load (K=7). Also shown on the figure is the theoretical BER-curve of BPSK with 2·(L+1)-th order diversity in Rayleigh fading channels (single-user bound) [27]. The normalized Doppler-spread of the time-varying channel was F_(d)·T_(S)=10⁻³, corresponding to a normalized coherence time

$\frac{T_{coh}}{T_{S}} = 1000.$ The stepsize of the NLMS algorithm was optimally chosen to be it μ=0.5. The optimal value of the RLS forgetting factor proved to be λ=0.9, corresponding to a data memory

$\frac{1}{1 - \lambda}\mspace{20mu}{being}\mspace{20mu}\frac{1}{100}$ of the normalized coherence time. Only for rather high SNR per bit, the SS-CE achieves better performance than the RAKE receiver. The FS-CE with RLS updating outperforms the corresponding SS-CE by 4 dB for a BER of 10⁻³ and approaches the single-user bound. Performance of NLMS updating is within one dB of that of RLS updating.

FIG. 23 and FIG. 24 show the same curves as before but now for half (K=15) respectively full (K=31) system load. For half system load, the SS-CE and the FS-CE with RLS updating outperform the RAKE by 4 dB respectively 7 dB for a BER of 10⁻². The performance of NLMS updating remains within one dB of that of RLS updating. For full system load, the RAKE clearly exhibits an error floor due to the increased MUI, while both the SS-CE and the FS-CE remain near/far resistant.

FIG. 25 shows the average BER versus the average SNR per bit of the FS-CE with RLS updating for different normalized Doppler spreads F_(d)·T_(s) ranging from 10⁻⁶, over 10⁻³ to 10⁻². The optimal values of λ were determined to be 0.9999, 0.9 respectively 0.8. The performance of the adaptive chip equalizer remains reasonable over a large range of the normalized Doppler spread. Only for a normalized Doppler spread of F_(d)·T_(s)=10⁻² the performance degrades significantly.

J.4 Conclusion

We have proposed a new pilot-aided adaptive fractionally-spaced receiver structure for the forward link of DS-CDMA systems employing aperiodic overlay scrambling codes and operating in fast fading multipath channels. By equalizing the multipath channel effects, the receiver restores the orthogonality between the different users and therefore suppresses the MUI. An NLMS or RLS symbol rate adaptation scheme, well-suited for fast fading channels, has been obtained by reversing the order of the equalizer and the descrambler/despreader for the updating part. Simulation results show that the proposed receiver outperforms the conventional RAKE receiver with perfect channel knowledge, for both slow and fast fading multipath channels.

K. Embodiment

Another embodiment of the invention is shown in FIG. 26. A block is transmitted from a base-station to a terminal (Step 120). The block comprises a plurality of chip symbols scrambled with a base station specific scrambling code, the plurality of chip symbols comprising a plurality of spread user specific data symbols which are user specific data symbols spread by using user specific spreading codes and at least one pilot symbol. In the terminal, at least two independent signals that comprise at least a channel distorted version of the transmitted block are generated (Step 140). After the independent signals are generated, the two independent signals are combined with a combiner filter with filter coefficients which are determined by using the pilot symbol, thus a combined filtered signal is obtained (Step 160). Lastly, the combined filtered signal is despread and descrambled with a composite code of the base-station specific scrambling code and one of the user specific codes (Step 180). While the above description has pointed out novel features of the invention as applied to various embodiments, the skilled person will understand that various omissions, substitutions, and changes in the form and details of the device or process illustrated may be made without departing from the scope of the invention. Therefore, the scope of the invention is defined by the appended claims rather than by the foregoing description. All variations coming within the meaning and range of equivalency of the claims are embraced within their scope.

APPENDIX

-   [1] L. B. Milstein, “Wideband code division multiple access,” IEEE     Journal on Selected Areas in Communications, vol. 18, no. 8, pp.     1344–1354, August 2000. -   [2] R. Price and P. E. Green, “A communication technique for     multipath channels,” IRE, vol. 46, pp. 555–570, March 1958. -   [3] S. Verdú, Multiuser Detection, Cambridge University Press, 1998. -   [4] M. K. Tsatsanis, “Inverse filtering for CDMA systems,” IEEE     Transactions on Signal Processing, vol. 45, no. 1, pp. 102–112,     January 1997. -   [5] M. K. Tsatsanis and Z. Xu, “Performance analysis of minimum     variance receivers,” IEEE Transactions on Signal Processing, vol.     46, no. 11, pp. 3014–3022, November 1998. -   [6] X. Wang and H. V. Poor, “Blind equalization and multiuser     detection in dispersive CDMA channels,” IEEE Transactions on     Communications. vol.46, no.1; January 1998; p.91–103., vol. 46, no.     1, pp. 91–103, January 1998. -   [7] I. Ghauri and D. T. M. Slock, “Blind and semi-blind single-user     receiver techniques for asynchronous CDMA in multipath channels,” in     GLOBECOM, December 1998, vol. 6, pp. 3572–3577. -   [8] F. Petré, M. Engels, A. Bourdoux, B. Gyselinckx, M. Moonen,     and H. De Man, “Extended MMSE receiver for multiuser interference     rejection in multipath DS-CDMA channels,” in Vehicular Technology     Conference (VTC-Fall), September 1999, vol. 3, pp. 1840–1844. -   [9] D. Gesbert, J. Sorelius, P. Stoica, and A. Paulraj, “Blind     multiuser MMSE detector for CDMA signals in ISI channels,” IEEE     Communications Letters, vol. 3, no. 8, pp. 233–235, August 1999. -   [10] M. Honig and M. K. Tsatsanis, “Adaptive techniques for     multiuser CDMA receivers,” IEEE Signal Procssing Magazine, vol. 17,     no. 3, pp. 49–61, May 2000. -   [11] R. Holoma and A. Toskala, WCDMA for UMTS, Wiley, 2001. -   [12] A. Klein, “Data detection algorithms specially designed for the     downlink of CDMA mobile radio systems,” in VTC, May 1997, vol. 1,     pp. 203–207. -   [13] C. D. Frank and E. Visotsky, “Adaptive interference suppression     for Direct-Sequence CDMA systems with long spreading codes,” in     Allerton Conference on Communication, Control and Computing,     September 1998, pp. 411-420. -   [14] I. Ghauri and D. T. M. Slock, “Linear receivers for the DS-CDMA     downlink exploiting orthogonality of spreading sequences,” in     Asilomar Conference on Signals, Systems and Computers, November     1998, vol. 1, pp. 650–654. -   [15] K. Hooli, M. Latva-aho, and M. Juntti, “Multiple access     interference suppression with linear chip equalizers in WCDMA     downlink receivers,” in GLOBECOM, December 1999, vol. General     Conference (Part A), pp. 467–471. -   [16] T. P. Krauss, M. D. Zoltowski, and G. Leus, “Simple MMSE     equalizers for CDMA downlink to restore chip sequence: Comparison to     Zero-Forcing and RAKE,” in ICASSP, May 2000, vol. 5, pp. 2865–2868. -   [17] T. P. Krauss and M. D. Zoltowski, “Blind channel identification     on CDMA forward link based on dual antenna receiver at handset and     cross-relation,” in Asilomar Conference on Signals, Systems and     Computers, November 1999, vol. 1, pp 75–79. -   [18] K. Li and H. Liu, “A new blind receiver for downlink DS-CDMA     communications,” IEEE Communications Letters, vol. 3, no. 7, pp.     193–195, July 1999. -   [19] S. Muduloda and A. Paulraj, “A blnd multiuser receiver for the     CDMA downlink,” in ICASSP, May 2000, vol. 5, pp. 2933–2936. -   [20] R. G. Vaughan, “Polarization diversity in mobile     communications,” IEEE Transactions on Vehicular Technoloy, vol. 39,     no. 3, pp. 177–186, August 1990. -   [21] N. S. Correal and B. D. Woerner, “Enhanced DS-CDMA uplink     performance through base station polarization diversity and     multistage interference cancellation,” in GLOBECOM, December 1998,     vol. 4, pp. 1905–1910. -   [22] R. Fantacci and A. Galligani, “An efficient RAKE receiver     architecture with pilot signal cancellation for downlink     communications in DS-CDMA indoor wireless networks,” IEEE     Transactions on Communications, vol. 47, no. 6, pp. 823–827, June     1999. -   [23] D. Gesbert, P. Duhamel, and S. Mayrague, “On-line blind     multichannel equalization based on mutually referenced filters,”     IEEE Transactions on Signal Processing, vol. 45, no. 9, pp.     2307–2317, September 1997. -   [24] S. Haykin, Adaptive Filter Theory, Prentice Hall, 3 edition,     1996. -   [25] F. Petré, M. Moonen, M. Engels, B. Gyselinckx, and H. De Man,     “Pilot-aided adaptive chip equalizer receiver for interference     suppression in DS-CDMA forward link,” in Vehicular Technology     Conference (VTC-Fall), September 2000, vol. 1, pp. 303–308. -   [26] S. Buzzi and H. V. Poor, “Channel estimation and multiuser     detection in long-code DS/CDMA systems,” IEEE Journal on Selected     Areas in Communications, vol. 19, no. 8, pp. 1476–1487, August 2001. -   [27] J. G. Proakis, Digital Communications, McGraw-Hill, 3 edition,     1995. -   [28] Geert Leus, Signal Processing Algorithms for CDMA-based     Wireless Communications, Ph.D. thesis, KULeuven, May 2000. 

1. In a wireless communication system having at least one basestation and at least one terminal, a method of wideband multiple access communication, comprising: transmitting a block from a basestation to a terminal, the block comprising a plurality of chip symbols scrambled with a base station specific scrambling code, the plurality of chip symbols comprising a plurality of spread user specific data symbols which are user specific data symbols spread by using user specific spreading codes and at least one pilot symbol; and performing in the terminal: generating a plurality of independent signals having at least a channel distorted version of the transmitted block; combining the plurality of independent signals with a combiner filter with filter coefficients which are determined by using the pilot symbol, thus obtaining a combined filtered signal; and despreading and descrambling the combined filtered signal with a composite code of the basestation specific scrambling code and one of the user specific spreading codes.
 2. The method of claim 1, wherein the combining comprises filtering the plurality of independent signals with a filter having filter coefficients which are determined by using the pilot symbol to obtain filtered signals, and combining the filtered signals to obtain the combined filtered signal.
 3. The method of claim 1, wherein the filter coefficients are determined without channel estimation.
 4. The method of claim 1, wherein the filter coefficients are determined such that one version of the combined filtered signal is as close as possible to a version of the pilot symbol.
 5. The method of claim 4, wherein, in the transmitted block, the pilot symbol is spread by a pilot code.
 6. The method of claim 5, wherein the version of the combined filtered signal is the combined filtered signal after despreading with a composite code of the basestation specific scrambling code and the pilot code, and wherein the version of the pilot symbol is the pilot symbol.
 7. The method of claim 5, wherein the version of the combined filtered signal is the combined filtered signal, after projecting on the orthogonal complement on the subspace spanned by the composite codes of the base station specific scrambling code and the user specific codes, and wherein the version of the pilot symbol is the pilot symbol spread with a composite code of the base station specific scrambling code and the pilot code.
 8. The method of claim 4, wherein the transmitted block comprises a plurality of pilot symbols, each being spread by a user specific code.
 9. The method of claim 8, wherein the version of the combined filtered signal is the combined filtered signal after despreading with a composite code of the basestation specific scrambling code and a user specific code, and wherein the version of the pilot symbol is the pilot symbol.
 10. The method of claim 8, wherein the version of the combined filtered signal is the pilot portion of the combined filter signal, and wherein the version of the pilot symbol is the sum of the spread pilot symbols within the transmitted block.
 11. The method of claim 10, wherein further the filter coefficients are determined such that the data portion of the combined filter signal after projecting on the orthogonal complement on the space spanned by the composite codes of the base station specific scrambling code and the user specific codes is as close as possible to the zero space.
 12. The method of claim 1, wherein at least J base stations are transmitting the block to the terminal having at least an integer number M greater than or equal to J+1/(St*Sp) physical antennas, where St is a temporal oversampling factor and Sp is a polarization diversity.
 13. The method of claim 1, wherein the terminal constructs at least M=J+1 independent signals, and performs space-time combining J times, each of the combined filtered signals being a multi-user interference suppressed version of the signal transmitted from one of the basestations; and wherein the despreading is performed J times, to obtain J intermediate soft estimates of a user specific data symbol.
 14. The method of claim 13, further comprising: performing linear combining of the J despread combined signals so as to obtain a final soft estimate of a user specific data symbol; and determining a hard estimate for the user specific data symbol from the final soft estimate.
 15. In a wireless communication system having at least one basestation and at least one terminal, a method of wideband multiple access communication, comprising: transmitting a block from a basestation to a terminal, the block comprising a plurality of chip symbols scrambled with a base station specific scrambling code, the plurality of chip symbols comprising a plurality of spread user specific data symbols, which are user specific data symbols spread by using user specific spreading codes and at least one pilot symbol; transmitting projection information, enabling projecting of a signal on the orthogonal complement of the space spanned by the composite codes of the basestation specific scrambling code and the user specific codes; and performing in the terminal: generating at least two independent signals comprising at least a channel distorted version of the transmitted block; combining with a combiner filter with filter coefficients determined by using the projection information being determined on the signals, thus obtaining a combined filtered signal; and despreading and descrambling the combined filtered signal with a composite code of the basestation specific scrambling code and one of the user specific codes.
 16. The method of claim 15, wherein the user specific codes have length N and the projection information is determined by using the active user signals.
 17. The method of claim 15, wherein the user specific codes have length N and the projection information is determined by using a predetermined fixed set of maximum allowable active user signals.
 18. A receiver system for wireless communication, comprising: a receiver configured to receive a block comprising a plurality of chip symbols which are data symbols spread by using scrambling and spreading codes and at least one pilot symbol; at least one combining filter circuit configured to receive at least two independent signals derived from the received block and a circuit configured to determine coefficients of the combining filter circuit by using the pilot symbol, and wherein the combining filter circuit outputs a multi-user interference suppressed signal based on the coefficients; and at least one despreading circuit configured to despread the multi-user interference suppressed signal with one of the scrambling and spreading codes.
 19. A receiver system for wireless communication, comprising: a receiver configured to receive a block comprising a plurality of chip symbols which are data symbols spread by using scrambling and spreading codes, and projection information, enabling projecting of a signal on the orthogonal complement of the space spanned by scrambling and spreading codes; at least one combining filter circuit configured to input at least two independent signals derived from the received block and output a multi-user interference suppressed signal; and at least one despreading circuit configured to despread the multi-user interference suppressed signal with one of the scrambling and spreading codes.
 20. The system of claim 19, further comprising a circuit configured to determine the coefficients of the combining filter circuit's by using the projection information, and wherein the combining filter circuit outputs the multi-user interference suppressed signal based on the coefficients.
 21. A receiver system for wireless communication, comprising: J combining filter circuits, each of the combining filter circuits receiving a plurality of independent signals and outputing a multi-user interference suppressed signal a circuit configured to determine the coefficients of the combining filters circuit; a plurality of despreading circuits configured to despread the J multi-user interference suppressed signals; and a combining circuit configured to combine the J multi-user interference suppressed signals.
 22. In a wireless communication system having at least one basestation and at least one terminal, a method of wideband multiple access communication, comprising: receiving a block comprising a plurality of chip symbols scrambled with a base station specific scrambling code, the plurality of chip symbols comprising a plurality of spread user specific data symbols which are user specific data symbols spread by using user specific spreading codes and at least one pilot symbol; generating at least two independent signals based on the received block; and combining the at least two independent signals with a combiner filter having filter coefficients which are determined by using the pilot symbol, thus obtaining a combined filtered signal.
 23. The method of claim 22, further comprising despreading and descrambling the combined filtered signal with a composite code of the basestation specific scrambling code and one of the user specific spreading codes.
 24. A receiver system for wireless communication, comprising: means for receiving a block comprising a plurality of chip symbols scrambled with a base station specific scrambling code, the plurality of chip symbols comprising a plurality of spread user specific data symbols which are user specific data symbols spread by using user specific spreading codes and at least one pilot symbol; means for generating at least two independent signals based on the received block; and means for combining the at least two independent signals with a combiner filter having filter coefficients which are determined by using the pilot symbol, thus obtaining a combined filtered signal.
 25. The method of claim 24, further comprising means for despreading and descrambling the combined filtered signal with a composite code of the basestation specific scrambling code and one of the user specific spreading codes. 